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EXPERIMENTAL  STUDY  OF  THE  TILT  ANGULAR  ANISOPLANATIC  CORRELATION  AND  THE 
EFFECT  OF  STRATOSPHERIC  TURBULENCE  ON  STAR  IMAGE  MOTION 
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Abstract 

The  results  of  experimental  studies  of  the  effect  of  stratospheric  turbulence  on  star  image  motion  and  tilt  angular 
anisoplanatism  are  presented.  A  Polaris  jitter  experiment  was  designed  and  conducted  to  determine  the  effect  cf 
stratospheric  inhomogeneities  on  star  image  motion.  Polaris  permits  us  to  measure  star  image  motion  with  a  bolted 
telescope.  In  this  case  the  mechanical  telescope  motion  is  eliminated.  High  resolution  spatial  and  temporal  statistics 
of  the  Polaris  image  motion  were  obtained.  It  was  shown  that  the  dependencies  of  the  Polaris  jitter  variance  on  the 
telescope  diameter  have  a  knee.  This  knee  cannot  be  explained  based  on  the  Kolmogorov  model  for  the  atmospheric 
turbulence.  At  the  same  time,  the  knee  is  consistent  with  the  model  which  takes  into  account  the  effect  of 
stratospheric  turbulence.  The  dependence  of  the  power  spectra  of  the  Polaris  image  jitter  on  the  telescope  diameter 
was  also  studied.  It  was  determined  that  the  measured  spectral  slopes  deviate  from  the  theoretical  predictions.  In  the 
high  frequency  domain  the  /”8/3  and  /~9/3  behavior  was  observed,  whereas  the  /~11/3  behavior  is  predicted  by  the 
theory.  In  the  low  frequency  domain  the  measured  slope  is  close  to  -1,  whereas  the  slope  -2/3  follows  from  the 
theoretical  predictions.  The  dependence  of  the  knee  frequency  of  the  power  spectrum  on  the  telescope  diameter, 
nevertheless,  agrees  well  with  the  corresponding  formula.  The  tilt  temporal  correlation  scale  increases  proportionally 
to  the  telescope  diameter.  For  the  aperture  of  0.1  m  and  1.5  m  it  is  in  the  range  from  0.1  s  to  1  s.  To  study  the  tilt 
angular  anisoplanatism,  a  Moon  edge  jitter  experiment  was  designed  and  carried  out.  By  measuring  the  angular 
motion  of  small  portions  of  the  edge  of  the  Moon  at  different  angular  separations,  a  standard  deviation  of  the  lateral 
tilt  difference  was  determined.  The  preliminary  analysis  shows  that  the  measured  dependencies  of  the  tilt  difference 
standard  deviation  on  the  separation  differ  from  the  theoretical  predictions  and  exceed  the  corresponding  dependencies 
obtained  from  observations  of  the  stars.  A  cross  linking  effect  of  the  longitudinal  tilt  is  considered  as  a  possible 
reason.  A  spatial  filtering  technique  should  be  applied  to  the  measured  data  to  eliminate  this  effect  The  future 
development  of  the  above  studies  is  considered. 
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EXPERIMENTAL  STUDY  OF  THE  TILT  ANGULAR  ANISOPLANATIC  CORRELATION  AND  THE 
EFFECT  OF  STRATOSPHERIC  TURBULENCE  ON  STAR  IMAGE  MOTION 


Mikhail  S.  BelerTkii 

In  tr  oduc  tioo 

Adaptive  optics  systems  {1-3}  can  potentially  provide  us  with  a  diffraction  limited  resolution  for  the  large  aperture 
telescopes.  However,  the  inability  of  the  conventional  laser  guide  star  (LGS)  method  to  sense  a  global  tilt  across  the 
telescope  aperture  greatly  limits  the  area  of  the  sky  accessible  for  adaptive  optics  correction.  To  achieve  full  sky 
coverage,  a  tilt  sensing  technique  with  a  LGS  should  be  developed. 

The  technique  for  sensing  full  aperture  tilt  with  a  LGS  has  been  suggested  {4,5}.  This  technique  exploits  a  laser 
beam  transmitting  through  the  main  telescope  and  two  auxiliary  telescopes  separated  in  transverse  directions.  A  tilt 
angular  anisoplanatism  is  exploited  in  this  technique  to  eliminate  a  contribution  of  a  down  propagation  path  to  the 
measured  tilt  Due  to  the  fact  that  light  emitted  from  different  portions  of  a  laser  beacon  on  the  way  to  an  auxiliary 
telescope  passthrough  different  inhomogeneities,  a  LGS  image  is  wavy.  The  waviness  scale  is  determined  by  the 
tilt  angular  correlation  scale.  If  the  FOV  of  the  receiver  is  much  larger  than  the  tilt  angular  correlation  scale,  then  the 
contribution  of  a  down  propagation  path  might  be  averaged  out  by  averaging  the  LGS  image  motion  over  it’s 
angular  extent.  A  full  aperture  tilt  might  be  also  measured  by  using  a  small  aperture  beam  transmitting  from  behind 
a  portion  of  a  primary  mirror  {6}.  Simultaneous  observations  of  the  LGS  image  motion  with  both  a  main  and 
auxiliary  telescope  are  required  in  this  scheme. 

In  order  to  design  a  tilt  sensing  experiment  with  the  LGS  {4-6},  estimates  for  the  tilt  angular  correlation  scale  for 
the  site  are  needed.  Such  estimates  might  be  obtained  by  using  the  corresponding  theoretical  relationship.  However, 
that  requires  the  data  for  the  vertical  profile  of  C2(/i)  and  an  outer  scale  of  turbulence.  Unfortunately,  there  are  no 
reliable  data  for  the  outer  scale  available  in  the  literature.  To  overcome  this  difficulty  an  experimental  study  of  the 
tilt  anisoplanatism  was  suggested.  To  study  the  tilt  angular  anisoplanatism,  the  Moon  edge  jitter  experiment  was 
designed  and  conducted.  By  measuring  the  lateral,  or  transverse,  angular  motion  of  small  portions  of  the  Moon  edge 
image,  at  variable  separations  from  each  other,  a  standard  deviation  of  the  lateral  tilt  difference  was  determined.  This 
development  was  performed  as  a  part  of  this  project,  jointly  with  a  team  of  researchers  at  the  Starfire  Optical  Range 
(SOR)  at  Phillips  Laboratory,  Albuquerque,  NM. 

Another  important  phenomenon  which  might  affect  the  performance  of  adaptive  systems  is  the  effect  of  stratospheric 
turbulence  on  star  image  degradation.  The  theoretical  analysis  {7,8}  shows  that  this  effect  is  important  for  large 
aperture  telescopes.  To  verify  the  theoretical  predictions,  a  Polaris  image  jitter  experiment  was  designed  and  carried 
out  at  SOR. 
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Polaris  jitter  gxpgrinrent 

To  determine  the  effect  of  non-Kolmogorov  stratospheric  turbulence  on  the  star  image  motion,  a  Polaris  jitter 
experiment  was  designed  using  a  ground-based  telescope.  Polaris  was  chosen  for  this  study  for  the  following  reason. 

A  non-Kolmogorov  stratospheric  turbulence  produces  an  effect  on  the  star  image  motion,  which  is  similar  to  that  fix 
uncontrolled  motion  of  the  telescope.  In  order  to  isolate  the  atmospheric  motion,  the  telescope  jitter  should  be 
eliminated.  Because  of  the  slow  angular  motion,  Polaris  permits  us  to  make  measurements  with  a  bolted  telescope. 
In  this  case  the  mechanical  motion  is  removed. 

The  experiment  was  based  on  observations  of  the  Polaris  image  motion  by  using  a  64  x  64  CCD  (Lincoln  Lab) 
imaging  system  at  the  1.5  m  telescope.  The  five  aperture  masks  (D  =  0.1,  0.2,  0.5,  0.75,  and  1.5  m)  in  conjunction 
with  the  appropriate  ND  filters  to  keep  incident  flux  constant  and  to  achieve  maximum  flux  without  saturating  the 
CCD  camera  were  used  to  obtain  high  resolution  spatial  and  temporal  statistics  of  the  Polaris  image  jitter.  To  assure 
that  the  measured  data  are  not  affected  by  internal  turbulence  within  an  optical  path,  the  image  motion  of  an  internal 
source  was  measured  with  the  same  experimental  set  up.  It  was  shown  that  the  jitter  variance  of  the  internal  source  is 
more  than  one  order  magnitude  smaller  than  that  for  a  natural  star. 

The  following  observation  procedure  was  exploited.  The  telescope  views  Polaris  with  a  symmetrical  42  x  42  arcsec 
FOV,  yielding  resolution  of  0.66  arcsec/pixel.  The  telescope  is  first  pointed  toward  Polaris  and  bolted  in  position 
such  that  the  earth’s  rotation  causes  the  image  of  Polaris  to  track  horizontally  across  the  FOV  (at  approximately  1  \i 
rad/sec).  The  data  are  collected  with  a  frame  rate  of  200  Hz  over  100  s,  and  with  a  rate  of  400  Hz  over  50  s,  and 
stored.  The  K-mirror  is  used  to  ensure  the  stellar  image  tracks  along  one  CCD  channel  without  crossing  into  an 
adjacent  channel.  Each  data  run  consists  of  20,000  frames.  Next,  the  telescope  is  unbolted  and  pointed  at  Polaris, 
keeping  the  image  centered  as  data  is  collected.  Comparison  of  the  jitter  statistics  determined  from  the  two  data  sets 
permits  us  to  isolate  the  atmospheric  and  mechanical  jitter.  The  optimum  pixel  size  of  0.66  arcsec  was  determined 
by  using  a  numerical  simulation  procedure,  developed  by  Capt.  D.  Monterra.  This  pixel  size  presents  an  optimum 
tradeoff  between  sampling  error  and  read  noise. 

Data  collection  and  processing  procedures 

The  data  were  collected  during  four  nights  in  May,  June,  and  August,  1996.  The  14  data  sets  corresponding  to  both 
bolted  and  unbolted  telescope  operation  were  collected  and  stored.  For  each  experimental  session  the  meteorological 
data  including  wind  speed  and  direction,  coherence  diameter  (r0)  and  isoplanatic  angle  (0o)  were  recorded,  as  well. 
During  these  observations  a  coherent  diameter  r0  was  in  the  range  from  3  cm  to  8.5  cm. 

The  data  processing  procedure  included  the  following  steps:  1)  the  sky  background  was  subtracted  from  the 
measured  data,  and  the  correction  for  the  dome  flat  was  performed;  2)  the  two  one-axis  (X  and  Y)  coordinates  of  the 
centroid  position  were  determined  for  each  of  the  20,000  frames;  3)  the  trends  were  removed  from  the  time  series  for 
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the  cenroid  coordinates;  4)  the  Polaris  image  jitter  variance,  the  tilt  temporal  power  spectrum,  and  tilt  temporal 
correlation  coefficient  were  calculated  for  each  aperture  mask. 

Rs suits  obtained 

a)  Dependence  of  the  star  jitter  variance  on  the  telescope  diameter 

The  dependencies  of  one-axis  Polaris  image  jitter  variance  on  the  telescope  diameter  obtained  during  three  days  are 
presented  in  Figs.  la-c.  The  experimental  dependencies  are  shown  along  with  the  two  theoretical  predictions.  One 
prediction  corresponds  to  the  Kolmogorov  model  with  infinite  outer  scale  of  turbulence,  which  yields  the  well 
known  dependence,  o1  ~D~m{ 9}.  Another  prediction  corresponds  to  the  finite  outer  scale.  Even  though  the 
experimental  dependencies  deviate  from  the  theoretical  predictions,  a  larger  than  -1/3  slope  of  experimental  curves  for 
small  D  might  be  attributed  to  a  finite  outer  scale. 

According  to  both  predictions  based  on  the  Kolmogorov  model,  a  star  jitter  variance,  cr2,  should  gradually  decrease 
with  increasing  telescope  diameter.  However,  as  it  is  shown  in  Figs,  la-c,  the  experimental  dependencies  at 
D>  0.75  m  have  a  knee.  This  knee  cannot  be  explained  on  the  basis  of  Kolmogorov  model.  At  the  same  time,  it 
can  be  explained  by  taking  into  account  the  effect  of  stratospheric  turbulence.  Indeed,  according  to  {7,8},  the 
stratospheric  phase  structure  function  is  approximately  quadratic,  and  the  stratospheric  component  of  the  star  image 
motion  does  not  depend  on  the  telescope  diameter.  Due  to  this  fact,  the  dependencies  of  the  star  jitter  variance  on  the 
telescope  diameter  have  a  knee.  Because  the  telescope  mechanical  motion  in  these  measurements  was  eliminated, 
and  the  contribution  of  the  turbulence  within  the  optical  train  was  negligibly  small,  one  should  assume  an  influence 
of  inhomogeneities  with  the  spectrum  which  differs  from  the  Kolmogorov  model  to  explain  the  knee. 

b)  Tilt  Temporal  Power  Spectra 

The  temporal  power  spectra  of  one-axis  components  of  the  Polaris  centroid  were  calculated  from  the  time  series  for 
the  data  collected  during  the  four  nights  of  observation.  About  60  power  spectra  for  one-axis  tilt  components 
corresponding  to  both  bolted  and  unbolted  telescope  performance  and  five  aperture  masks  were  calculated.  For  the 
time  series  with  a  frame  repetition  rate  of  200  Hz  the  Nyquist  frequency  equals  100  Hz,  and  it  is  200  Hz  for  the  time 
series  with  a  frame  repetition  rate  of  400  Hz. 

According  to  Kolmogorov  model  and  frozen  turbulence  hypothesis,  the  tilt  temporal  power  spectrum  follows  the 
dependence  f~l]n  in  the  high  frequency  domain,  whereas  it  obeys  to  the  /~2/3  law  in  the  low  frequency  range  {10}. 
The  knee  frequency  of  the  power  spectrum  is  determined  by  the  equation  fk  =03(V  /  D),  where  V  is  a  wind  speed 
and  D  is  the  telescope  diameter.  At  the  knee  frequency,  fk,  a  slope-break  marks  the  separation  between  the  two 
regimes. 
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The  power  spectra  for  one-axis  tilt  component  for  D  =  0.1  m  and  D  =  1.5  m  for  unbolted  telescope  performance  are 
shown  in  Figs.  2a-b.  By  comparing  these  plots,  it  is  easy  to  see  that  the  spectra  strongly  depend  on  the  telescope 
diameter.  The  power  spectrum  for  the  small  aperture,  D  =  0.1  m,  exceeds  that  for  the  large  one,  D  -  1.5  m,  in  both 
low  and  high  frequency  ranges.  Such  a  behavior  corresponds  to  an  enlargement  of  the  Polaris  jitter  variance  with 
decreasing  the  telescope  diameter  (see  Figs.  la-c).  The  power  spectra  have  a  slope  which  is  close  to  -  8/3  =  -  2.66  in 
the  high  frequency  range,  and  the  slope  is  -  1  in  the  low  frequency  range.  The  knee  frequency  increases  with 
decreasing  the  telescope  diameter. 

The  two  dependencies  of  the  knee  frequency  on  the  telescope  diameter  measured  at  different  days  along  with  the 
theoretical  predictions  calculated  from  the  formula  /k  =  0.3(V/D)  are  presented  in  Fig.  3a  and  b.  The  knee  frequency 
for  the  smallest  aperture,  D  =  0.1  m,  in  the  first  case  is  equal  to  25  Hz  that  corresponds  to  the  wind  velocity  of  8.3 
m/s.  In  the  other  case  the  knee  frequency  is  45  Hz  corresponding  to  the  wind  speed  of  15  m/s. 

To  verify  the  fact  that  the  measured  data  are  not  affected  by  the  turbulence  within  the  optical  train,  the  temporal 
power  spectra  with  an  internal  source  were  also  measured.  The  power  spectra  for  the  internal  source  for  D  -  0.1  m 
and  1.5  m  are  shown  along  with  the  atmospheric  spectra  for  unbolted  telescope  performance  in  Fig.  4.  It  is  shown, 
that  the  power  spectrum  for  the  small  aperture,  D  =  0.1  m,  exceeds  by  almost  two  order  of  magnitude  that  for  the 
large  one,  D  =  1.5  m,  in  the  high  frequency  range.  Both  spectra  for  Polaris  exceed  that  for  the  internal  source  in  both 
low  and  high  frequency  ranges. 

It  is  also  seen  that  the  power  spectrum  for  a  D  =  1.5  m  aperture  and  unbolted  telescope  performance  have  a 
noticeable  bump  in  the  range  from  70  Hz  to  90  Hz.  This  bump,  however,  does  not  appear  in  the  spectra  for  the 
bolted  telescope  performance.  Therefore,  the  bump  is  associated  with  the  mechanical  telescope.  The  relative 
contribution  of  the  bump  to  the  Polaris  jitter  variance,  however,  is  negligibly  small. 

The  analysis  shows  that  the  majority  of  the  spectra  for  the  largest  aperture,  D  =  1.5  m,  have  a  slope  in  the  range 
from  -2.3  to  -2.6  with  a  mean  value  -2.45.  For  the  smallest  aperture,  D  =  0.1  m,  a  spectral  slope  varies  in  the  range 
from  -2.6  to  -3  in  the  high  frequency  domain,  and  it  is  close  to  -1  in  the  low  frequency  range.  The  implication  is 
that  the  measured  slopes  deviate  from  the  predictions  based  on  the  Kolmogorov  model  and  the  frozen  turbulence 
hypothesis.  In  particular,  the  spectral  slopes  close  to  -8/3  and  -9/3  were  observed  in  the  high  frequency  domain, 
whereas  the  slope  -11/3  was  predicted.  In  the  low  frequency  domain  the  slope  -2/3  was  predicted  by  the  theory, 
whereas  the  slope  -1,  was  observed.  It  should  be  noted  that  the  /“8/3  behavior  in  the  high  frequency  range  has  been 
also  observed  in  the  stellar  interferometric  measurements  {11}. 

There  are  some  speculations  that  the  theoretical  f~x  1/3  high-frequency  law  is  generally  not  observed  because  of  noise 
and  aliasing  effects  in  wave-front  sensing.  An  alternative  explanation  is  that  wind  velocity  fluctuations  and  turbulent 
inhomogeneities  evolution  might  affect  the  behavior  of  the  power  spectra.  Thus,  the  spectral  slopes  are  not  perfectly 
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consistent  with  the  theoretical  predictions.  An  additional  analysis  of  the  measured  data,  as  well  as,  a  more 
sophisticated  theoretical  model  is  needed  for  interpretation  of  the  above  measured  dependencies. 

c)  Tilt  Temporal  Correlation 

From  the  time  series  for  the  energy  centroid  the  tilt  temporal  correlation  coefficients  were  calculated.  The  correlation 
coefficients  were  calculated  separately  for  the  X  and  Y  centroid  for  each  aperture  size.  The  correlation  coefficients  fcr 
one-axis  tilt  components  corresponding  to  an  unbolted  telescope  performance  are  shown  in  Figs.  5a-b.  If  one 
determines  the  tilt  temporal  correlation  scale  as  the  time  lag  over  which  the  correlation  coefficient  decreases  to  the 
level  e'1,  then  it  is  shown  from  the  data  presented  in  Figs.  5a-b,  that  the  tilt  correlation  scale  for  the  largest  aperture 
is  about  1  s,  whereas  for  the  smallest  aperture  it  is  less  than  0.1  s.  It  is  seen  that  the  tilt  correlation  scale  increases 
proportionally  to  the  telescope  diameter. 

Conclusion 

To  verify  the  predicted  effect  of  the  stratospheric  turbulence  on  star  image  motion,  the  Polaris  image  jitter 
experiment  was  designed  and  carried  out.  Polaris  image  motion  was  measured  by  using  both  bolted  and  unbolted 
telescopes.  In  the  bolted  case  the  mechanical  telescope  motion  was  eliminated.  High  resolution  spatial  and  temporal 
statistics  of  the  Polaris  image  motion  were  obtained.  It  was  shown  that  the  dependencies  of  the  Polaris  jitter  variance 
on  the  telescope  diameter  have  a  knee.  The  knee  cannot  be  explained  based  on  the  Kolmogorov  model,  but  it  can  be 
explained  by  taking  into  account  the  effect  of  stratospheric  turbulence.  The  measured  spectral  slopes  deviate  from  the 
theoretical  predictions.  At  the  same  time,  the  dependencies  of  the  knee  frequency  on  the  telescope  diameter  agree 
well  with  the  corresponding  formula.  For  the  telescope  diameter  of  0.1  m  and  1.5  m,  respectively,  the  temporal  tilt 
correlation  scale  is  in  the  range  from  0. 1  s  to  1  s,  and  it  increases  proportionally  to  the  telescope  diameter. 

Future  development 

The  current  experimental  design  does  not  permit  us  to  measure  star  image  motion  with  different  aperture  masks 
simultaneously.  Due  to  this  fact,  the  data  can  suffer  from  rapid  variations  in  atmospheric  seeing  conditions.  This 
makes  a  comparison  of  the  jitter  variances  for  different  telescope  diameters  difficult.  A  modification  in  the  optical 
scheme  which  permits  us  to  record  the  star  images  for  the  five  aperture  masks  at  the  same  frame  has  been  suggested 
by  Capt.  S.  Karis.  The  influence  of  the  variations  of  the  atmospheric  seeing  conditions  in  a  new  scheme  is 
eliminated.  Measurements  of  the  Polaris  image  motion  with  a  modified  optical  scheme  at  a  3.5  m  telescope  are 
planned. 

Tilt  angular  correlation  and  tilt  anisoplanatism 

Let  us  consider  first  the  theoretical  predictions  for  the  tilt  angular  anisoplanatism. 
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Results  of  the  theoretical  analysis 

The  angular  correlation  of  the  tilts  for  two  plane  waves  arriving  at  the  telescope  of  diameter  D  at  the  angles  zero  and 
0  is  considered  by  using  the  Zemike  polynomials  expansion  for  the  atmospheric  wave-front  distortions  { 12-15}.  For 
the  von  Karman  power  spectrum  of  the  refractive-index  fluctuations  <t>,{K)  =  0.033C,(K2  +  K02Jt)’",\ where 


*o,  =  3  .08  /  Lo,  a  tilt  correlation  coefficient  has  the  form 


b,M  =  - 


iyhCl(h)£dK{K'  +  K10Ky"'tJl{K) 


IK 


J.  dhCl(h)  \;dK{K'  +  K'0')- w>Jl(K)/K 


(1) 


where  J0  and  J2  are  the  Bessel  functions.  The  negative  sign  in  Eq.  (1)  corresponds  to  the  longitudinal,  or  parallel  to 
the  separation  0,  tilt,  X,  whereas  the  positive  sign  corresponds  to  the  lateral  tilt,  Y.  The  tilt  correlation  coefficients 
calculated  by  using  Eq.  (1)  for  the  vertical  profile  Cj(h)  given  by  the  model  C\(h)  =  C20  exp(-A/£)  are  shown  in 

Fig.  6.  Here  h  is  the  effective  altitude,  or  characteristic  scale,  of  the  turbulent  atmosphere  determined  by  the  ratio 
h  =  jr,  / /i,  of  the  first  two  moments  for  the  vertical  profile  Cj (h) :  n,  =  j~ Cl(h) dh  and  /i,  =  J”  C* ( h ) hdh .  If  the 

parameters  C*0  and  h  are  equal  to  Cjg  =  2.37  x  10"15m"J/5  and  h  =  934  m,  respectively,  the  two  moments  p0  and  |i, 
for  the  above  C2(/i)  model  coincide  with  that  for  the  Hufnagel-Valley  (HVy,)  model  { 16,17}.  For  the  HV-54  model 
these  parameters  equal  C20  =  8.55  x  10_l6m_2/3  and  h  =  3,466  m.  The  tilt  structure  function,  or  tilt  anisoplanatism, 
Dy(9) ,  is  related  to  the  tilt  angular  correlation  coefficient  b}(6)  through  the  equation 

D,(6)  =  ([a,(0)-a,m2)  =  2(tJ(L0  /  D)[l-by(0/G,)],  (2) 

where  oy  is  a  one-axis  tilt  variance  depending  on  the  ratio 4 ID.  Therefore,  one  can  calculate  the  tilt 


anisoplanatism  by  knowing  the  correlation  coefficient  by{6)  and  the  variance  cr,2 . 


From  Fig.  6  it  is  shown  that  the  tilt  angular  correlation  gradually  decreases  with  increasing  angular  distance,  0.  For 
small  separation  ( 9  «  9, )  it  approaches  die  asymptote  bir(G)  -  1  -(9 1 9,f .  For  large  angular  separation  (0  »  9 ) 

and  for  the  Kolmogorov  model  with  Z,  =  the  asymptote  has  the  form  (9  /  9,)~m .  For  a  finite  outer  scale,  £,,  and 
large  angular  separation,  0,  the  correlation  coefficients  for  the  lateral  and  longitudinal  tilt  have  the  asymptotes, 
(0/0,) ',  and,  (0/0,) 2 ,  respectively.  Therefore,  the  asymptotes  and  the  tilt  angular  correlation  coefficient  behavior 
for  the  finite  and  infinite  outer  scale  are  different  The  implication  is  that  the  Kolmogorov  model  with  infinite  outer 
scale  (Lq  =  »)  does  not  permit  us  to  describe  the  tilt  angular  correlation. 

One  can  introduce  the  tilt  angular  correlation  scale,  0, ,  as  the  distance  between  the  propagation  paths  over  which  the 
correlation  coefficient,  ft, ,  decreases  to  the  level  e~l(by(9,)  =  e~l).  From  the  numerical  estimates  for  by(9)  presented 
in  Fig.  6,  it  is  easy  to  find  that  the  tilt  correlation  scale  0,  is 

0,  =a(L0l  D)D/h ,  (3) 


1-8 


where  a  is  the  coefficient  of  proportionality.  This  coefficient  describes  the  effect  of  the  outer  scale  of  turbulence, 
on  the  tilt  correlation  scale.  When  o  =  constant,  the  tilt  correlation  scale,  0, ,  is  proportional  to  D,  however,  due  to 
the  dependence,  a(L0  /  D ),  the  tilt  correlation  scale  0,  varies  with  the  telescope  diameter  D  non-linearly.  The  latter 
is  seen  in  Table.  1,  where  the  estimates  for  the  tilt  correlation  scale,  0, ,  for  different  models  of  C\(h)  vertical  profile 
and  outer  scale  are  presented. 


Table  1.  Tilt  angular  correlation  scale,  6t  ,  arcsec 

Telescope  diameter,  D,  m 

0.1 

0.2 

0.3 

0.5 

1.5 

3.5 

HV-54, 

h  =3,466  m 

Lo=3  m 

13 

21 

26 

33 

69 

i39 

Lo=  6  m 

17 

27 

35 

46 

87 

172 

HV5/7, 

h  =  934  m 

L0=3  m 

48 

78 

96 

122 

255 

5 1 6 

L0=6  m 

63 

100 

130 

170 

323 

638 

The  correlation  scale,  0,,  increases  with  increasing  both  the  telescope  diameter  D,  and  the  outer  scale  1^.  The 
correlation  scale,  6t ,  is  smaller  for  the  model  HV-54  than  that  for  the  model  HVy,.  The  later  is  due  to  the  fact  that 

the  HV-54  model  contains  stronger  turbulence  at  high  altitudes  than  the  model  HV5/7.  The  high  altitude  turbulence 
more  severely  degrades  the  tilt  angular  correlation  than  that  near  the  ground.  The  implication  is  that  one  can  expect 
the  tilt  correlation  scale  to  be  smaller  at  the  sites  with  good  seeing  conditions  than  that  at  the  sites  with  bad  seeing. 
In  Fig.  7  the  tilt  correlation  coefficient  by(6)  is  plotted  versus  6  /  Qt  representing  a  universal  tilt  correlation 
curve. 

Experimental  study 

To  study  the  tilt  angular  correlation  and  tilt  anisopalantism,  the  Moon  edge  experiment  was  designed  and  carried 
out.  In  this  experiment  the  lateral,  or  transverse  to  the  image,  motion  of  the  small  portions  of  the  Moon  edge  was 
measured  by  using  a  high  resolution  CCD  camera.  By  selecting  the  two  portions  of  the  Moon  edge  image  at 
different  angular  separations,  the  tilt  structure  function,  or  standard  deviation  of  the  tilt  difference,  was  determined. 
Special  attention  in  this  experiment  was  paid  to  the  dependence  of  the  tilt  anisoplanatism  on  the  aperture  size.  For 
this  purpose  the  observations  were  performed  with  the  five  aperture  masks  (D  =  0.1,  0.2,  0.5,  0.75,  and  1.5  m). 

The  experimental  set  up  for  the  Moon  edge  experiment  was  similar  to  that  for  the  Polaris  jitter  experiment.  A  64  x 
64  CCD  camera  with  angular  pixel  size  of  0.66  arcsec,  yielding  42  x  42  arcsec  FOV  was  used.  The  K-mirror  was 
exploited  to  ensure  the  Moon  edge  be  parallel  to  one  CCD  pixel  row.  The  data  were  collated  with  a  frame  rate  cf 
200  Hz  during  100  s,  and  with  a  frame  rate  of  400  Hz  during  50  s.  The  corresponding  exposure  times  are  equal  5  ms 
and  2.5  ms,  respectively.  Each  data  run  consists  of  20000  frames. 
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The  data  processing  procedure  included  several  steps.  First,  the  sky  background  was  subtracted  and  a  flat  field 
correction  was  performed.  After  that,  the  Moon  edge  position  was  calculated  for  each  column  in  each  Game  by  using 
a  median  algorithm.  Then,  a  standard  deviation  of  the  tilt  difference  as  a  function  of  angular  separation  between  the 
column  was  calculated  from  the  sequence  of 20,000  frames  for  die  five  aperture  masks. 

The  preliminary  results  are  shown  in  Fig.  8.  The  standard  deviations  of  the  differential  centroid  motion  measured 
with  a  2.4  m  telescope  {18}  are  also  presented  at  the  same  plot.  The  latest  results  were  obtained  by  measuring  a 
differential  motion  of  the  centroids  of  closely  spaced  pairs  of  stars  visible  in  the  60  arcsec  wide  FOV. 

It  is  easy  to  see  that  the  behavior  of  the  experimental  curves  1-3  for  small  angular  separations  differs  from  the 
theoretical  prediction.  According  to  the  asymptote  for  the  tilt  correlation  coefficient,  bt  ,(8)  -  1  ~(0/6,f ,  the  tilt 

structure  function  for  small  separations  should  obey  to  the  square  power  law:  Dy(d)  =  (d/6tf.  Therefore,  the  tilt 

difference  standard  deviation  should  be  proportional  to  the  angular  separation.  However,  the  experimental  curves  1-3 
do  not  follow  this  prediction.  Besides,  the  curves  1-3  exceed  the  corresponding  standard  deviations  obtained  from 
the  stellar  observations. 

A  possible  reason  for  the  above  discrepancies  is  the  influence,  or  cross  linking  effect,  of  the  longitudinal  tilt  on  the 
measurements  of  the  lateral  tilt.  Because  the  atmospheric  motion  is  two-dimensional,  and  the  jitter  rms  is 
comparable  with  a  pixel  size,  a  longitudinal  tilt  might  result  in  displacement  of  the  portion  of  the  Moon  edge  image 
from  one  column  to  the  other.  Due  to  this  fact,  an  extra  lateral  tilt  is  determined  by  the  median  algorithm.  The  cross 
linking  effect  leads  to  enlargement  of  the  tilt  structure  function  and  the  standard  deviation. 

To  eliminate  the  cross  linking  tilt  effect,  a  spatial  filtering  technique  should  be  applied  to  the  sequence  of  frames 
with  the  Moon  edge  images,  or/and  the  angular  size  of  the  smallest  portion  of  the  edge  image  should  be  enlarged  by 
banding  the  neighboring  columns.  In  any  case,  a  more  detailed  analysis  of  the  full  set  of  data  by  using  the  algorithm 
for  elimination  of  the  cross  linking  effect  should  be  performed. 

Conclusion 

The  Moon  edge  experiment  was  designed  and  carried  out  to  study  the  tilt  angular  anisoplanatism.  The  data  woe 
collected  and  processed.  The  comparison  with  the  measurements  of  a  differential  motion  of  the  centroids  of  closely 
spaced  pairs  of  stars  and  the  theoretical  prediction  was  made.  The  comparison  shows  that  the  measured  dependencies 
deviate  from  that  obtained  from  observations  of  the  stars  and  differ  from  the  theoretical  predictions.  A  cross  linking 
tilt  effect  is  considered  as  a  possible  reason  for  these  deviations.  Further  analysis  of  the  measured  data  is  planned  to 
be  performed  by  exploiting  the  filtering  algorithm  for  elimination  of  the  tilt  cross  linking  effect. 
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Fig.  2  a  and  b.  Polaris  image  jitter  power  spectra  for  the  telescope 
diameter  of  D  =  0.1  m  (a)  and  D  =  1.5  m  (b). 
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Fig.  3a  and  b.  Dependence  of  the  knee  frequency  on  the 
telescope  diameter  measured  in  two  days. 
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Fig.  4.  Temporal  power  spectra  for  Polaris  (curves  1  and  2)  and  an  internal 
source  (curves  3  and  4).  Curves  1  and  3  correspond  D  =  0.1  m  and  curves  2  and 
4  correspond  to  D  =  1.5  m. 
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Fig.  5a  and  b.  Temporal  correlation  coefficient  for  one-axis  tilt 
components.  Curves  1,  2,  3,  4,  and  5  corresponds  to  D  =  0.1,  0.2,  0.5,  0.75, 
and  1.5  m,  respectively. 


0/(D/h) 

Fig.  6.  Angular  correlation  coefficient  for  the  lateral  and 
longitudinal  tilts  on  the  vertical  propagation  path.  Curves  1-4 
correspond  to  the  lateral  tilt,  and  curves  5  and  6  correspond  to  the 
longitudinal  one.  Ratio  L/D  is  equal  to  infinity  for  curve  1,  Lj/D  = 
50  for  curve  2,  L/D  =  10  for  curves  3  and  5,  and  L/D  =  3  for 
curves  4  and  6. 


Fig.  7.  Universal  dependence  of  the  tilt  correlation 
coefficient  on  the  angular  separation. 
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Fig.  8.  Dependence  of  the  standard  deviation  of  the  tilt  difference  on 
the  angular  separation.  Curves  1,  2,  and  3  correspond  to  D  =  0.1,  0.5, 
and  1.5  m,  respectively.  Squares  show  the  standard  deviations  of  the  centroid 
motion  obtained  from  observations  of  the  stars  with  a  2.4  m  telescope  {18}. 
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-  ABSTRACT  - 

This  report  examines  a  specific  fiber-optic  code  division  multiple  access  (O-CDMA)  digital 
communication  system.  Performance  results  on  data  detection  for  different  settings  of  various 
system  parameters  are  presented.  Previous  work  in  this  area  produced  preliminary  performance 
results,  however  the  need  for  further  investigations  became  evident.  The  simulation  results 
presented  involve  a  system  accommodating  multiple  users,  operating  in  both  synchronous  (same 
bit  rate  per  user)  and  asynchronous  (different  user  bit  rates  with  unrelated  clocks)  modes.  The 
effect  of  various  system  parameters  and  operating  conditions,  such  as  data  rate,  receiver  filter 
characteristics,  photodiode  efficiency  factor,  and  noise  induced  distortion  have  been  investigated. 

The  report  focuses  on  describing  the  simulation  system  implemented  and  presenting  the  results  of 
an  exhaustive  evaluation  of  system  performance  as  a  function  of  the  many  parameters  that  can 
vary  in  practice.  The  primary  emphasis  centers  on  demonstrating  the  appropriate  bit  stream 
recovery  for  each  of  the  users  having  access  to  the  system  under  different  operational  scenarios. 

Implementation  (in  software)  of  a  novel  Optical  Orthogonal  Code  (OOC),  used  as  an  encoding 
mask  for  the  on-off  modulating  carrier  waves  at  set  frequencies,  proved  to  be  successful  in 
reducing  crosstalk  in  systems  accommodating  multiple  users.  A  DSP  simulation  package  named 
“Signal  Processing  Workbench”  (SPW)  was  used  to  obtain  the  computer  generated  performance 
results.  SPW  offers  a  block  oriented  approach  to  simulating  the  complex  structure  of  the 
communication  system  under  study.  By  using  standard  library  blocks  or  creating  user  specified 
blocks  almost  any  system  can  be  thoroughly  evaluated,  subject  to  processor  limitations  (i.e. 
memory,  simulation  run  time,  etc.). 

General  conclusions  that  can  be  made  based  on  the  computer  simulations  performed  are  as 
follows.  High  data  rates,  (higher  than  100  KHz  for  typical  parameter  values  in  the  simulation)  will 
result  in  errors  in  decoded  binaiy  data  outputs.  Furthermore,  the  filter  parameters  used  to  separate 
the  individual  frequency  components  in  the  received  signal,  require  delicate  adjustment  in  order  to 
capture  the  exact  signal  energy  level  for  each  frequency  component.  It  was  found  that  leakage 
light  in  the  sources  used  to  generate  the  lightwave  signals,  needs  to  be  suppressed  20  dB  below 
the  light  transmitting  energy  level,  in  order  to  guarantee  reasonable  average  error  rates.  Specific 
simulations  show  that  white  Gaussian  noise  added  to  the  on-off  keyed  modulated  signals,  results 
in  errors  in  the  decoded  user  data  for  Signal  to  Noise  Ratios  below  approximately  6  dB. 
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-  INTRODUCTION - 

Due  to  the  explosive  growth  of  information  transmission  in  the  last  20  years,  the  need  for  high 
speed  communication  systems  is  evident.  Computer  networks,  either  local  or  wide  (i.e.,  the 
internet)  are  well-known  examples  of  transmission  systems  requiring  high  capacity,  multi- 
user/multi-access  capability  and  high  data  rates/throughput.  Lightwave  technology  developments 
have  made  it  possible  to  develop  systems  with  such  characteristics.  An  overview  of  the 
development  of  lightwave  communication  technology  may  be  found  in  [1],  Studies  in  the  field  of 
lightwave  communication  networks  with  a  goal  of  increasing  transmission  speed  and  the  number 
of  simultaneously  active  users  while  at  the  same  time  offering  reliable  overall  system  performance, 
have  led  to  a  better  understanding  of  physical  limitations  imposed  by  optical  and  non-optical 
components. 

In  an  optical  communication  system,  information  is  transmitted  by  light  propagation  inside  a  glass 
fiber  in  the  form  of  a  coded  sequence  of  optical  pulses.  The  technique  known  as  Optical  Code 
Division  Multiple  Access  (O-CDMA)  provides  a  means  of  developing  a  reliable  high  speed  multi¬ 
user  communication  system.  There  are  other  techniques  for  developing  such  communication 
systems,  however  a  detailed  explanation  of  the  various  techniques  used,  is  beyond  the  scope  of 
this  report.  A  preliminary  investigation  into  the  possibility  of  carrying  out  computer  simulations  of 
an  O-CDMA  communication  system  has  been  performed.  The  results  of  this  system  performance 
investigation,  (reported  in  [4]),  were  used  as  a  starting  point  for  this  research  project. 

Recovery  (i.e.,  decoding  of  the  user  signals  at  the  input  of  the  receiver)  of  encoded  binary 
information  in  the  O-CDMA  communication  system,  can  be  performed  in  several  ways,  as 
described  in  an  excellent  treatment  on  the  subject,  [5], 
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Recovery  of  the  binary  information  in  the  O-CDMA  digital  communication  system  under  study  is 
based  on  decisions  made  against  set  threshold  levels  and  implemented  with  photodetectors  and 
counters.  In  previous  work  it  was  found  that  the  use  of  bipolar  codes  (as  opposed  to  unipolar 
codes)  in  the  O-CDMA  communication  system  under  study  is  fundamental  to  its  practical 
operation.  As  explained  in  Section  5  of  the  report,  the  system’s  threshold  for  unipolar  coded 
systems  was  found  to  be  dependent  on  the  number  of  active  users.  In  practice,  the  number  of 
active  users  at  a  certain  time  can  only  be  determined  if  there  is  some  sort  of  network  control 
present.  This  would  impose  significant  design  challenges  as  in  typical  O-CDMA  communication 
environments,  the  network  controller  would  have  to  monitor  the  number  of  users  and  their 
characteristics. 

Ideally  the  O-CDMA  system  should  allow  random  access  to  an  indefinite  number  of  users, 
without  having  the  communication  channel  subdivided  into  time  or  frequency  slots.  Additional 
users  subscribing  to  the  system,  can  be  assigned  unique  codes  and  then  given  channel  access 
without  the  need  to  synchronize  with  any  of  the  other  users.  This  is  the  essence  of  the  system  that 
has  been  investigated  via  computer  simulations  under  various  operational  scenarios. 

The  advantage  of  assigning  unique  codes  to  each  user  is  the  security  provided  against  possible 
attempted  eavesdropping  by  unauthorized  parties,  to  whom  the  signal  will  be  unintelligible,  unless 
the  code  is  known.  (A  detailed  explanation  on  the  use  of  coding  may  be  found  in  Section  3  in  this 
report.)  Some  theoretical  background  on  the  generation  of  the  codes  used  in  this  system 
simulation  study  can  be  found  in  [6],  [7]  as  it  relates  to  a  maximal  length  PN-code  and  in  [8]  for 
the  Walsh  code.  A  detailed  presentation  on  the  characterization  and  performance  of  optical  codes 
in  a  CDMA  environment  can  be  found  in  [9]  and  [10],  whereas  [11]  deals  with  an  analytical 
approach  to  describing  the  performance  of  O-CDMA  systems. 


- 1.  THE  O-CDMA  SYSTEM  -  GENERAL  PRINCIPLES  - 

Optical  code  division  multiple  access  (O-CDMA)  communications  provides  a  means  to  combine 
the  vast  bandwidth  available  in  direct  detection  optical  systems  with  the  advantages  of  spread 
spectrum  multiplexing.  Non-Optical  CDMA  techniques  utilize  spread  spectrum  radio  frequency 
modulation  techniques  based  on  the  assignment  of  individual  spreading  codes  to  each  of  the  users. 
By  means  of  encoding,  the  binary  user  signals  in  a  CDMA  digital  communication  system  are 
spread  in  the  frequency  domain  so  that  each  user  occupies  the  entire  bandwidth  available  resulting 
in  total  bandwidth  utilization.  The  optical  medium  is  superbly  suited  for  spread  spectrum  multiple 
access,  due  to  its  extremely  large  bandwidth.  In  fact,  because  of  the  limited  speed  of  electronic 
signal  processors,  the  process  of  optical-to-electrical  and  electrical-to-optical  conversion  in  a  fiber 
optic  network  limits  how  much  fiber  bandwidth  can  be  used.  Once  fully  developed  and  integrated, 
optical  components  will  offer  much  higher  speeds  for  optical  signal  processing  than  their 
electronic  counterparts.  Due  to  these  advantages,  CDMA  has  been  the  focus  of  much  research  in 
the  last  twenty  years,  primarily  in  the  radio  frequency  domain,  but  also  in  the  optical  domain. 
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O-CDMA  is  a  bandwidth  utilization  scheme  in  which  many  users  access  a  common  fiber  optic 
channel  simultaneously,  synchronously  or  asynchronously  through  the  use  of  data  encoding.  This 
multi-access  system  does  not  use  time  or  frequency  allocation,  and  users  may  transmit  without  the 
delays  inherent  in  multi-access  protocols.  The  O-CDMA  scheme  takes  advantage  of  excess 
bandwidth  in  single  mode  fibers  in  order  to  achieve  the  random  multi-user  communication  access, 
essentially  free  of  network  control. 

A  typical  O-CDMA  system  is  best  represented  by  an  information  data  source,  followed  by  an 
optical  encoder  containing  a  laser  or  some  other  optical  light  source,  when  the  information  is  an 
electrical  signal.  An  optical  encoder  is  used  to  map  the  electrical  binary  information  into  optical 
sequences.  These  optical  signals  are  then  coupled  onto  a  single  mode  fiber  optic  channel.  At  the 
receiving  end  of  the  O-CDMA  system,  the  optical  pulse  sequences  are  decoded  and  converted 
back  to  their  electrical  form.  Figure  1  shows  a  typical  O-CDMA  system  configuration. 


Fig.  1 A  fiber-optic  communication  system. 

The  optical  pulse  sequences  can  be  decoded  in  two  different  ways.  One  way  involves  a 
comparison  of  the  received  sequence  to  a  stored  replica  of  itself  (correlation  process),  followed  by 
a  threshold  level  comparator.  Another  way  to  decode  the  optical  sequences  is  to  use  a 
combination  of  filters  centered  around  the  different  center  frequencies  as  set  in  the  encoder,  in 
combination  with  a  photodetector  and  counter. 

In  a  typical  O-CDMA  there  are  “N”  such  transmitters  (user  data  sources)  and  receiver  pairs. 
Figure  2  shows  the  block  diagram  of  a  typical  O-CDMA  system. 
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Fig.  2  Block  diagram  of  a  fiber  optic  code  division  multiple  access  (FO-CDMA)  network 
in  a  star  configuration. 

To  send  information  from  one  user  to  another  in  an  O-CDMA  digital  communication  system,  the 
address  code  for  the  intended  receiver  is  impressed  upon  the  data  from  the  transmitting  data 
source.  One  of  the  principles  of  O-CDMA  signals  involves  the  ability  to  extract  the  data  from  the 
received  optical  signal  in  the  presence  of  all  other  users’  signals.  This  means  that  at  every  receiver 
the  appropriate  data,  sent  by  the  other  half  of  the  receiver-transmitter  pair,  needs  to  be  decoded 
correctly  even  under  the  very  likely  scenario  in  which  multiple  user  receiver-transmitter  pairs  are 
active  and  transmitting  at  the  same  time. 

Possible  interference  between  signals  from  each  of  the  users  in  the  single  mode  fiber  optic 
transmission  channel  therefore  needs  to  be  minimized  in  order  to  reduce  “crosstalk”  between  the 
users  and  thus  errors  in  the  recovered  data.  This  will  be  examined  in  greater  detail  in  Section  7.2 
of  the  report. 


-  2.  TRANSMISSION  IMPAIRMENTS  - 

There  are  unfortunately  numerous  sources  of  transmission  impairments  in  lightwave 
communication  systems.  The  different  impairments  that  are  of  importance  in  the  design  of  O- 
CDMA  systems  will  be  discussed  in  this  section. 


2.1  Inter  symbol  interference 

Inter  symbol  interference  (ISI)  is  caused  by  signal  dispersion.  Dispersion  can  be  viewed  as  the 
effect  in  which  signal  pulses  are  delayed  with  the  length  of  the  delay  in  the  received  signal  varying 
with  the  signal  frequency. 
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ISI  refers  to  those  impairments  that  distort  and  broaden  the  width  of  signal  pulses,  resulting  in  self 
interference  in  the  individual  user  signals.  This  effect  will  limit  the  maximum  bit  rate  that  can  be 
used  over  the  transmission  channel.  This  can  be  seen  in  Figure  3  where  the  broadening  of  the 
lightpulses  is  shown  as  well  as  its  resulting  interference. 
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Fig.  3  Inter  symbol  interference  (ISI). 

The  signals  “a”  and  “b”  in  Figure  3  show  the  same  information  content  of  a  binary  sequence.  The 
logic  “1”  in  this  figure  implies  the  presence  of  light  in  the  fiber,  whereas  the  logic  “0”  implies  the 
absence  of  light.  Signal  “a”  represents  the  ideal  situation  where  light  pulses  have  zero  rise  and  fall 
time.  Signal  “b”  in  Figure  3  represents  the  more  realistic  situation  in  an  optical  fiber.  The 
broadening  of  the  lightpulses  may  cause  signal  overlap  between  individual  light  pulses  if  the  data 
rate  is  sufficiently  high.  This  effect  (ISI)  can  be  seen  between  tj  and  t2  as  well  as  t3  and  t4,  causing 
a  logic  “0”  to  be  received  as  a  logic  “1”.  If  the  effect  is  too  severe,  data  errors  will  occur  forcing 
data  rates  to  be  reduced.  This  means  that  the  time  interval  between  pulses  (i.e.  the  distance 
between  time  positions  t3  and  t4  in  Figure  3)  will  be  larger  and  overlapping  pulses  will  be  less 
likely  to  occur. 

The  broadening  of  the  lightwave  pulses  is  caused  by  physical  effects  known  as 

1 .  Chromatic  dispersion 

2.  Polarization  dispersion 

3.  Laser  nonlinearities 

These  three  effects  are  briefly  discussed  next. 


2.1.1  Chromatic  dispersion 

This  effect  is  caused  by  material  dispersion  in  the  fiber  optic  transmission  medium,  because  of 
slight  impurities  in  the  (atomic)  structure  of  the  glass  fiber.  The  large  increase  in  the  use  of  lasers 
in  communication  has  partly  come  about  as  result  of  improvements  in  the  quality  of  the  medium 
through  which  light  energy  is  transmitted.  (An  excellent  treatment  on  the  historical  development 
of  different  optical  fibers  can  be  found  in  [1].) 
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The  objective  of  most  lightwave  transmission  systems  is  to  maximize  the  bit  rate-distance  product 
(BL).  The  bit  rate  ( B )  is  limited  inherently  by  chromatic  dispersion  in  the  fiber,  which  is 
responsible  for  the  broadening  of  optical  pulses  during  their  propagation  inside  a  single-mode 
fiber.  The  maximum  distance  ( L ),  a  bit  stream  can  be  transmitted  through  an  optical-fiber  before  a 
repeater  (amplifier)  is  necessary  on  account  of  signal  loss,  also  is  a  very  important  parameter 
associated  with  the  optical-fiber. 

The  choice  of  operating  wavelength  is  related  to  the  loss  and  dispersion  characteristics  of  the 
fiber.  In  order  to  develop  a  sense  of  the  properties  of  single-mode  fibers  (as  would  be  used  in 
practical  realizations  of  the  simulated  O-CDMA  digital  communication  system),  Figures  4  and  5 
are  provided.  Figure  4  shows  a  diagram  of  measured  loss  as  a  function  of  wavelength  for  single¬ 
mode  silica  fibers.  The  arrows  in  Figure  4  indicate  the  wavelength  regions  used  for  optical  fiber 
communications.  Figure  5  shows  the  material  dispersion  for  typical  single-mode  silica  fibers.  Data 
points  for  three  different  fibers  are  shown. 


Fig.  4  Loss  in  a  single-mode  silica  fiber  as  a  function  of  wavelength  [2 ]. 


Fig.  5  Typical  material  dispersion  for  three  different  silica  fibers  [2]. 
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As  can  be  seen  from  these  figures,  minimum  fiber  loss  occurs  at  a  wavelength  near  1.55  [J.m. 
Modem  optical  communication  systems  use  lasers  operating  at  or  near  this  wavelength.  However 
the  zero  dispersion  wavelength,  which  is  at  about  1.3  //m,  is  different  from  the  minimum  loss 
region  wavelength.  It  is  however  possible  to  shift  the  zero  dispersion  wavelength  towards  the 
desirable  1.55  /j,m  region,  by  modifying  the  fiber  characteristics  [2], 


2.1.2  Polarization  dispersion 

Dispersion  can  be  polarization  dependent.  Two  signals  of  the  same  frequency,  but  with  opposite 
polarizations,  will  experience  different  delays. 


2.1.3  Laser  nonlinearities 

The  first  two  physical  effects  causing  dispersion,  increase  in  severity  with  increasing  transmission 
distances.  The  amount  of  delay  difference  between  lightwaves  of  different  frequencies  due  to 
chromatic  and/or  polarization  dispersion,  becomes  larger  with  increasing  transmission  distance. 
Laser  nonlinearities  cause  dispersion,  independent  of  transmission  distances.  Laser  nonlinearities 
increase  with  aging  of  the  laser.  The  lasers  used  to  modulate  data  do  not  produce  pure  signals  of 
an  exact  frequency.  The  spectrum  of  a  laser  signal  exhibits  a  certain  small  bandwidth  called  the 
linewidth.  Therefore  the  pulse  sequences  have  finite  linewidths. 

To  understand  the  role  of  the  spectral  width  AAS  of  the  laser  pulse,  it  is  known  that  during  its 
propagation  inside  a  fiber  of  length  L,  an  optical  pulse  broadens  and  its  width  increases  by  an 
amount  given  by 


At  =  Df  (AX)  L 


Eq.  1 


In  Eq.  1,  Df  is  the  fiber  dispersion  coefficient  having  a  value  of  ~  17  ps/(knvnm)  at  a  wavelength 
of  1 .55  //m  (see  Fig.  5).  In  order  for  the  pulse  to  arrive  within  its  assigned  time  slot,  the  pulse 
should  not  be  wider  than  R1 ,  B  being  the  bit  rate  in  Gb/s,  i.e., 

BA x  =  BLDf  (AX)  <1  Eq.  2 

A  typical  spectral  width  (A A)  for  an  InGaAsP  laser,  operating  on  1.55  iim  is  about  5  nm.  Using 
realistic  values  for  Df  and  AAS,  it  is  possible  to  calculate  the  bit-rate  distance  product,  BL.  The  BL- 
product  is  limited  to  less  than  1 0  Ghz*km.  This  means  that  a  repeater  spacing  in  a  transmission 
medium  of  100  km  will  limit  the  bit-rate  in  the  fiber  to  100  Mb/s.  This  is  considered  to  be  small  in 
today’s  long  haul  lightwave  transmission  systems. 
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2.2  Photodetector  nonlinearity 

Signal  dispersion  is  one  of  the  impairments  in  the  O-CDMA  digital  communication  system. 
Additional  impairments  involve  the  use  of  photodetectors  in  the  direct  detection  receiver.  A  direct 
detection  O-CDMA  digital  communication  system  uses  photodetectors  in  combination  with 
bandpass  filters.  The  output  of  the  photodetectors  is  a  measure  of  energy  for  signals  of  a  certain 
frequency  as  determined  by  the  bandpass  filters.  (A  thorough  discussion  of  the  working  details  of 
the  simulated  O-CDMA  digital  communication  system  is  presented  in  Section  6  of  this  report). 

The  number  of  photons  that  arrive  at  the  photodetector  are  counted,  used  as  a  measure  of  signal 
strength  and  converted  into  an  electric  current.  The  efficiency  of  this  whole  process  of  transferring 
light  energy  to  electric  current  is  however  not  ideal.  The  number  of  photons  counted  when  light 
impinges  on  a  photodetector  is  known  to  be  random  and  Poisson  distributed. 


2.3  Noise 

Noise  in  the  received  signal  can  consist  of  shot  noise,  thermal  noise,  laser  phase  noise,  or 
amplified  spontaneous  emission  noise  (ASE).  The  latter  noise  is  introduced  in  O-CDMA  systems 
through  the  use  of  optical  amplifiers.  Optical  amplifiers  are  however  not  used  in  the  O-CDMA 
digital  communication  system  under  study.  Shot  noise  is  introduced  by  the  photodetectors  and 
involves  photons  whose  number  is  Poisson  distributed.  The  random  variation  in  the  phase  of 
signals  transmitted  by  lasers  is  referred  to  as  phase  noise.  This  phase  noise  results  in  broadening  in 
the  linewidth  of  the  output  laser  spectrum.  This  results  in  dispersion  of  the  signals  in  the  O- 
CDMA  system  as  already  discussed.  Thermal  noise  is  introduced  mainly  by  the  receiver  pre¬ 
amplifiers  and  is  of  additive  Gaussian  form. 


-  3.  USER  CODE  MASKS  - 

There  are  several  reasons  why  binary  information  is  modulated  by  code  masks.  One  of  the  reasons 
is  that  the  information  sent  is  not  supposed  to  be  disturbed  or  read  by  unintended  users.  To  be 
sure  that  only  the  user  addressed  receives  the  information  that  has  been  sent,  every  user  needs  to 
be  assigned  a  unique  code.  The  codes  to  be  designed  have  to  meet  two  conditions: 

1.  each  sequence  should  be  easily  distinguished  from  shifted  versions  of  itself  and 

2.  each  sequence  should  be  easily  distinguished  from  versions  of  every  other  assigned  sequence. 
Codes  developed  with  these  requirements  that  meet  orthogonality  conditions  are  called  nearly 
Optical  Orthogonal  Codes.  In  this  section  the  kind  of  code  masks  used  in  this  investigation  will  be 
explained. 
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3.1  PN-codes 


A  code  sequence  generator,  that  produces  pseudonoise  (PN)-codes,  is  made  up  of  delay  elements 
with  appropriate  feedback.  By  means  of  combining  feedback  paths  with  the  delay  elements,  the 
generator’s  output  can  be  controlled  so  as  to  produce  an  output  sequence  with  certain  properties. 
A  linear  binary  shift  register  sequence  generator  using  feedback  and  modulo-2  addition  is  used  to 
generate  binary  PN-codes. 

The  pseudo-random  generator  is  so  called  because  of  its  repeating  output  sequence  and  the 
generation  of  what  appears  to  be  a  random  binary  output.  Every  sequence  consists  of  a  certain 
number  of  bits,  called  “chips”  before  repetition  sets  in.  The  larger  the  number  of  delays  used,  the 
longer  the  period  of  its  repeating  output  sequence.  The  number  and  placement  of  feedback  taps 
are  determined  by  the  generator  function.  A  diagram  of  a  PN-code  generator  shows  that  it 
consists  of  nothing  more  than  a  shift  register  in  conjunction  with  its  appropriate  logic,  which  feeds 
back  a  logical  combination  of  the  state  of  two  or  more  of  its  delay  elements  to  its  input  as  shown 
in  Figure  6. 


| - Modulo  P 

J— 
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Fig.  6  PN-generator. 

In  Figure  6,  n  is  the  last  delay  element,  normally  also  being  the  output  of  the  generator,  although 
the  output  of  any  delay  element  in  the  generator  can  be  used  to  provide  the  same  linear  code  with 
a  certain  amount  of  time  delay.  If  the  output  is  for  instance  taken  from  delay  element  (n-1)  instead 
of  n,  the  time  delay  or  shift  will  be  exactly  one  chip  duration.  The  “modulo-  P”  block  represents  a 
network  that  modulo-P  adds  outputs  of  different  delay  elements  together,  P  being  the  total 
number  of  delay  elements  added  together.  The  output  of  the  “modulo-P”  block  is  then  fed  back  to 
the  input,  which  is  considered  to  be  delay  element  number  one. 

The  generator  will  do  nothing  without  a  certain  initial  value  in  its  delay  elements.  If  any  number  of 
zeros  are  modulo-P  added  together,  the  result  will  consist  of  only  zeros.  Therefore  often  an  initial 
value  of  “one”  is  given  to  at  least  one  delay  element  and  the  first  n-bits  at  the  output  will  therefore 
show  at  most  ‘n’-ones,  as  can  be  seen  in  Figure  7  for  a  generator  made  up  of  3  stages. 

The  generator  can  produce  code  sequences  of  a  certain  maximum  period  length  N,  where  N  is 
specified  by  the  following  formula: 

N  =  2"  -  1  Eq.  3 

That  is,  N  is  the  number  of  chips  in  one  period  of  a  maximum  length  pseudo-random  bit  sequence 
and  n  is  the  number  of  delay  elements  in  the  feedback  shift  register  system. 
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The  statistical  distribution  of  the  ones  and  zeros  in  one  period  is  well  defined  and  is  always  the 
same  in  each  sequence.  Relative  positions  of  the  sequences  can  vary  from  sequence  to  sequence, 
but  each  repetition  represents  the  same  one-zero  sequence,  and  the  statistical  distribution  remains 
the  same.  To  show  that  each  generator  repeats  sequences  after  a  certain  time,  the  following 
example  is  shown.  A  generator  consisting  of  3  delay  elements,  in  which  the  outputs  of  element  1 
and  3  are  fed  back  to  the  input  of  the  first  element  is  shown  in  Figure  7. 


cod*  output 


Fig.  7  A  three  stage  PN-generator 


The  output  can  be  seen  being  repeated  every  7  chips.  With  the  initial  values  of  the  delay  elements 
being  all  ones,  the  sequence  shown  below  can  now  be  developed.  Note  that  the  values  in  columns 
1  and  3  are  modulo-2  added  and  deliver  the  new  input  value  to  delay  element  one.  After  exactly  7 
shifts  the  sequence  starts  repeating  itself.  Also  note  that  the  chips  in  the  third  column  form  the 
actual  output  sequence,  namely  the  PN-code. 


As  can  be  seen,  the  code  with  an  initial  value  of  ones  in  the  delay  elements,  is  the  repeated 
sequence  1110  10  0. 


Delay  element : 

Initial  value : 

- > 


(1  exor  1  gives  0) 


1  2  3 

1  1  1  <--- 
0  1  1 
1  0  1 
0  1  0 
0  0  1 
1  0  0 
1  1  0 

1  1  1  < - >  same  as  first  initial  value. 


A  4  stage  PN-generator  produces  a  maximum  code  length  of  15  chips,  given  by 
100110101111000 


Each  user  can  be  assigned  its  own  unique  code  by  shifting  the  PN-sequence  in  time.  When  the 
code  above  is  shifted  in  time  for  other  users  it  will  appear  as  shown. 

001101011110001  user  2 
011010111100010  user  3 
110101111000100  user  4  etc. 
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A  problem  arises  in  practice  because  when  the  information  arrives  at  the  receiver  it  is  delayed  in 
time  as  a  result  of  finite  propagation  velocity.  Without  an  absolute  time  reference,  this  means  the 
receiver  can  not  determine  which  user  has  sent  the  information. 

To  make  every  code  unique  for  each  user  the  PN-code  is  inverted  and  placed  next  to  the  original 
code,  so  a  code  of  30  chips  is  created  that  for  the  example  case  looks  like  this 

100110101111000  original  15  chips-code user  1 

011001010000111  inverted  15  chips-code  user  1 

100110101111000011001010000111  30  chips-code  user  1 

The  codes  generated  by  this  method  will  henceforth  be  called  modified  PN-sequences.  The 
original  code  can  be  shifted  in  time  15  times  which  means  15  unique  codes  can  be  produced.  And 
so  with  a  30  chip  code,  15  users  can  be  identified  without  an  absolute  time  reference. 


3.2  Walsh  codes 


Another  code  that  has  been  considered  in  this  investigation  is  the  Walsh  code.  The  discrete  Walsh 
Hadamard  transform  involves  rearrangements  of  discrete  Hadamard  matrices.  A  Hadamard  matrix 
of  order  N  is  defined  as  an  N  *  N  matrix,  which  for  dimensions  that  are  a  power  of  two,  can  be 
constructed  in  the  following  manner 


H. 


2N 


Hn 

Hn 

Hn 

-hn 

Eq.4 


with  Hj  =  [1],  Therefore, 


Hx  H, 

1  1 

h2  = 

= 

■  # i  i  • 

1  -1  . 

1111 
1-1  1-1 
1  1-1-1 
1-1-1  1 


and  for  N  =  2 


Ha  = 


h2  h2 
h2  -h2 


Eq.  5 


Eq.  6 
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For  N  =  4 


H. 


h4  -h4 


1111 
1-1  1-1 
1  1-1-1 
1-1-1  1 
1111 
1-1  1-1 
1  1-1-1 
1-1-1  1 


1111 
1-1  1-1 
1  1-1-1 
1-1-1  1 
-1  -1  -1  -1 

-1  1-1  1 

-1-1  1  1 

-111-1 


Eq.  7 


The  discrete  Walsh  Hadamard  transform  matrix  H  can  be  obtained  from  the  Hadamard  matrix  by 
reordering  the  rows  in  increasing  sequence  order.  The  sequency  of  a  row  is  half  the  number  of 
sign  changes  in  that  row.  In  Hg  the  first  row  has  no  sign  changes,  the  second  row  has  7  sign 
changes,  the  third  row  has  3  sign  changes,  and  so  on.  Reordering  the  Hg  matrix  in  increasing 
sequency  order,  results  in 


H 


1111 
1111 
1  1-1-1 
1  1-1-1 
1-1-1  1 
1-1-1  1 
1-1  1-1 
1-1  1-1 


1111 
-1  -1  -1  -1 

-1-1  1  1 

1  1-1-1 
1-1-1  1 
-111-1 
-1  1-1  1 

1-1  1-1 


Eq.  8 


This  matrix  shows  the  Walsh  codes  in  the  correct  order.  Therefore,  row  number  one  of  H  is 
Walsh  code  one,  row  number  two  is  Walsh  code  two  and  so  on.  The  first  Walsh  code  can  not  be 
used  in  this  investigation  because  it  contains  no  sign  changes.  Furthermore  Walsh  code  3,  5  and  7 
are  not  used  because  these  codes  are  shifted  versions  of  respectively  Walsh  code  2,  4  and  6.  So, 
when  a  matrix  of  8  chips  per  code  is  created  there  are  4  useful  codes  in  it  for  this  investigation. 
The  number  of  useful  codes  that  can  be  generated  is  half  the  dimension  of  the  H  matrix.  The 
dimension  of  the  H  matrix  used  for  the  system  simulation  is  32  by  32. 
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-  4.  CODE  CROSS-CORRELATION  - 


The  correlation  properties  of  code  sequences  depend  on  code  type,  length,  chip  rate,  and  even  the 
chip-by-chip  structure  of  the  particular  code  being  used.  Cross-correlation  in  communication 
systems  is  of  interest  in  several  areas,  such  as  (a)  code  division  multiple  access  systems  (or  any 
other  code  addressed  system)  in  which  receiver  response  to  any  signal  other  than  the  proper 
addressing  sequence  is  not  allowable,  and  (b)  antijamming  systems  that  may  employ  codes  with 
extremely  low  cross-correlation. 

Cross-correlation  is  a  measure  of  similarity  between  two  different  code  sequences.  It  is  a 
comparison  of  a  sequence  a„  and  another  sequence  bn  over  all  possible  time  shifts  of  one  with 
respect  to  the  other.  It  is  evaluated  from  the  formula 

c,  =  S  a  b  Eq.  9 

(m)  Yi  n  n-m  n 


Practically,  the  correlation  c(m)  can  be  obtained  by  counting  agreements  and  disagreements.  That 
is,  the  sequences  aligned  in  time  are  compared  with  each  other.  When  two  chips  being  compared 
are  one  or  are  zero,  it  is  called  an  agreement.  When  the  chips  are  different,  it  is  called  a 
disagreement.  The  greater  the  number  of  disagreements,  the  lower  the  cross-correlation  is 
between  the  sequences. 

The  cross-correlation  between  Walsh  codes  is  better  then  the  cross-correlation  between  modified 
PN-sequences  (modified  PN-sequences  are  explained  in  subsection  3.1).  The  modified 
PN-sequences  shown  in  Table  1  have  been  used  in  previous  work  as  reported  on  in  [4], 


Table  1  Modified  PN-sequences  for  7  users 


Modified  PN-sequences 


1 


1001  1010  1111  000  0110  0101  0000  111 


0100  1101  0111  100  1011  0010  1000  011 


0010  0110  1011  110  1101  1001  0100  001 


0001  0011  0101  111  1110  1100  1010  000 


1000  1001  1010  111  0111  0110  0101  000 


1100  0100  1101  011  0011  1011  0010  100 


1110  0011  0110  101  00  01  1101  1001  010 
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The  cross-correlation  between  sequence  3  and  sequence  4  (one  chip  shifted  in  time),  for  example, 
gives  the  result  shown  in  Table  2. 

Table  2  Cross-correlation  result  between  two  PN-sequences 


3 

00  10 

0  110 

10  11 

1  1  0 

110  1 

100  1 

0  100 

00  1 

fl 

00  10 

0  110 

10  11 

1  1  1 

110  1 

100  1 

0  100 

000 

CR 

AAAA 

AAAA 

AAAA  AAD 

AAAA 

AAAA 

AAAA 

AAD 

A  =  Agreement 
D  =  Disagreement 
CR  =  Correlation  Result 

The  number  of  chips  in  agreement  is  28  out  of  30.  The  more  chips  that  agree,  the  higher  the 
chance  of  cross-talk.  This  is  an  effect  that  needs  to  be  avoided  as  much  as  possible.  In  this  case 
there  is  always  a  pair  of  sequences  that  has  this  cross-correlation  level  thereby  increasing  the 
chance  of  cross-talk  between  those  two  signals  at  the  receiver. 

In  the  simulations  carried  out  all  the  useful  Walsh  codes  are  implemented  as  user  code  masks  (See 
Table  3).  Note  that  in  the  Tables  3  and  4  the  negative  ones  in  the  H  matrix  become  zero  in  the 
binary  Walsh  code. 


Table  3  Walsh  codes  for  16  users 


Walsh  codes 

1 

1111 

1111 

1111 

1111 

0000 

0000 

0000 

0  0  00 

2 

1111 

1111 

0000 

00  00 

1111 

1111 

0000 

0000 

1111 

0000 

1111 

0  000 

0000 

1111 

0000 

1111 

1111 

0000 

1111 

0000 

1111 

0000 

1111 

0000 

5 

1100 

00  11 

1100 

00  11 

00  11 

1100 

00  11 

1100 

1100 

00  11 

1100 

00  11 

1100 

00  11 

1100 

00  11 

D 

1100 

1100 

1100 

1100 

00  11 

00  11 

00  11 

00  11 

8 

1100 

1100 

1100 

1100 

1100 

1100 

1100 

1100 

9 

100  1 

100  1 

100  1 

100  1 

0  110 

0  110 

0  110 

0  110 
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Walsh  codes 


10 


1001  1001  0110  0110  1001  1001  0110  0110 


11 


1001  0110  1001  0110  0110  1001  0110  1001 


12 


1010  0101  1010  0101  1010  0101  1010  0101 


13 


1010  0101  1010  0101  0101  1010  0101  1010 


14 


1  0  1  0  1  0  1  0  0  1  0  1  0  1  0  1  1  0  1  0  1  0  1  0  0  1  0  1  0  1  0  1 


15 


1010  1010  1010  1010  0101  0101  0101  0101 


16 


1010  1010  1010  1010  1010  1  0  10  10  10  10  10 


When,  for  example,  Walsh  code  3  and  Walsh  code  4  (1  chip  shifted  in  time)  are  compared,  the 
number  of  agreements  is  16  out  of  32  (Table  4). 


Table  4  Cross-correlation  result  between  two  Walsh  codes 


1  3 

1111 

0000 

1111 

0000 

0000 

1111 

0000 

1  1  1  1 

n 

1110 

000  1 

1110 

000  1 

1110 

000  1 

1110 

000  1 

CR 

AAAD 

AAAD 

AAAD 

AAAD 

DDDA 

DDDA 

DDDA 

DDDA 

It  turns  out  that  regardless  of  which  Walsh  codes  are  cross-correlated,  the  number  of 
disagreements  between  them  will  always  be  16.  While  this  can  be  proven,  it  can  be  said  that  the 
cross-correlation  between  the  Walsh  codes  is  better  than  the  cross-correlation  between  the 
modified  PN-sequences,  because  of  the  number  of  agreements  that  result.  As  previously 
explained,  the  lower  the  number  of  agreements  between  the  sequences,  the  less  chance  there  is  of 
cross-talk  amongst  the  users.  In  subsection  7.4,  simulation  results  that  address  this  issue  can  be 
found. 
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-  5.  UNIPOLAR  AND  BIPOLAR  O-CDMA  SYSTEMS  - 


The  codes  in  Section  4  are  used  in  a  O-CDMA  bipolar  system.  In  the  following  two  subsections, 
both,  systems  using  unipolar  and  bipolar  codes  are  described  and  the  advantages  of  one  over  the 
other  are  explained. 


5.1  O-CDMA  system  using  unipolar  codes 

The  optical  unipolar  CDMA  system  can  be  described  as  follows.  Let  d^  represent  a  (0,  1)  binary 
bit  sequence  at  a  rate  R  bits/sec  associated  with  the  ith  user  of  an  N  user  system.  Such  user  is 
assigned  a  spreading  code  or  mask  Uj,  where  u;  consists  of  K  logic  states  0  and  1.  A  logic  1  in  U; 
implies  light  transmission  while  a  logic  0  implies  no  (or  very  low)  light  transmission.  A  very 
broadband  light  source  is  employed  from  which  K  distinct  frequencies  are  extracted  and  each 
assigned  to  one  of  the  logic  states  associated  with  the  mask.  When  d^  equals  1,  those  frequencies 
with  a  logic  1  state  in  Uj  are  transmitted  (light  is  “on”)  while  frequencies  associated  with  a  logic  0 
state  in  Uj  are  not  transmitted  at  all  (or  transmitted  at  a  very  low  level  perhaps  as  much  as  30  dB 
below  the  “on”  signal  level).  When  d„;  equals  0,  those  frequencies  associated  with  a  logic  state  1 
in  Uj  are  transmitted  at  a  level  low  enough  to  represent  leakage  light  (thus  modeling  the  inability  to 
completely  turn  off  the  light  source),  while  those  frequencies  associated  with  a  logic  state  0  in  U; 
are  not  transmitted  at  all  or  at  a  very  low  level,  as  previously  explained.  This  effectively  produces 
on-off  keyed  modulation  signals  at  each  of  the  transmitted  frequencies  of  the  broadband  source, 
which  are  summed  to  produce  a  composite  signal  that  is  equivalent  to  a  wavelength  multiplexed 
digital  signal.  At  the  receiver,  the  different  frequency  components  present  in  the  composite  signal 
are  separated,  identified  and  extracted.  Photodetectors  are  used  to  produce  a  photon  count  that  is 
proportional  to  the  light  intensity  present  at  each  frequency.  When  the  overall  photon  count 
associated  with  all  frequencies  that  can  be  transmitted  for  a  given  mask  exceeds  a  specified 
threshold,  a  logic  1  is  associated  with  d^  whereas  if  the  threshold  is  not  exceeded,  a  logic  0  is 
declared. 

Simulations  of  this  O-CDMA  system  using  unipolar  codes  were  developed  and  tested  as  part  of  a 
previous  investigation.  While  the  simulation  results  pointed  toward  a  system  that  worked 
properly,  it  was  found  that  key  to  its  operation  was  the  ability  to  correctly  set  the  receiver 
threshold  for  the  binary  decisions.  More  importantly,  the  correct  threshold  setting,  proved  to  be 
very  much  dependent  on  the  number  of  users  present.  It  was  felt  that  unless  adequate  prior 
knowledge  existed  about  how  many  users  were  simultaneously  active,  it  would  be  difficult  if  not 
impossible  to  have  such  a  system  operate  properly,  unless  very  sparse  user  codes  (or  masks)  were 
used  that  produced  little  or  no  interference  amongst  users.  This,  although  possible,  would  require 
very  long  user  codes  Uj  (because  each  mask  would  be  mostly  made  up  of  logical  0  states)  and 
therefore  a  very  broadband  source,  or  shorter  user  codes  accommodating  a  small  number  of  users 
[4],  These  limitations  makes  the  choice  between  a  system  using  unipolar  codes  and  one  using 
bipolar  codes  an  easy  one. 
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5.2  O-CDMA  system  using  bipolar  codes 

The  optical  bipolar  CDMA  system  can  be  described  as  follows.  As  in  subsection  5.1,  let  d„j 
represent  a  (0,  1)  binary  bit  sequence  at  a  rate  R  bits/sec  associated  with  the  ith  user  of  a  N  user 
system.  Such  user  is  assigned  a  spreading  code  or  mask  ui5  where  u;  consists  of  K  logic  states  0 
and  1.  Associated  with  code  U;  is  the  complementary  code  %  where  each  element  of  the  latter  is 
the  logical  complement  of  the  former.  A  concatenation  of  U;  and  Uj  is  formed  to  produce  a  code 
(or  mask)  of  2K  logical  states,  denoted  U;.  As  before,  a  logic  1  (in  Uj)  implies  light  transmission 
while  a  logic  0  implies  no  (or  very  low)  light  transmission.  A  very  broadband  light  source  is 
employed  from  which  2K  distinct  frequencies  are  extracted  and  each  assigned  to  one  of  the  logic 
states  associated  with  the  mask.  When  d„i  equals  1,  those  frequencies  associated  with  a  logic  1 
state  in  Ufare  transmitted  (light  is  “on”)  while  frequencies  associated  with  a  logic  0  state  in  Uj  are 
not  transmitted  at  all  (or  transmitted  at  a  very  low  level  perhaps  as  much  as  30  dB  below  the  “on” 
signal  level).  When  d„j  equals  0,  those  frequencies  associated  with  a  logic  state  1  in  U;  are 
transmitted  at  a  level  low  enough  to  represent  leakage  light  (thus  modeling  the  inability  to 
completely  turn  off  the  light  source),  while  those  frequencies  associated  with  a  logic  state  0  in  U; 
are  not  transmitted  at  all  or  at  a  very  low  level,  as  previously  explained.  In  addition  to  forming  a 
concatenated  code  or  mask,  the  bipolar  system  differs  from  the  unipolar  one  by  forming  the 
complement  U(  of  the  concatenated  code  U;.  This  complement  code  U;  is  operated  upon  by  <3^,  the 
complement  of  the  binary  sequence  associated  with  the  ith  user.  The  same  2K  frequencies 
extracted  from  the  broadband  source  are  assigned  to  each  one  of  the  logic  states  associated  with 
the  mask.  When  equals  1,  those  frequencies  associated  with  a  logic  1  state  in  U;  are  transmitted 
(light  is  “on”)  while  frequencies  associated  with  a  logic  0  state  in  U;  are  not  transmitted  at  all  (or 
transmitted  at  a  very  low  level  perhaps  as  much  as  30  dB  below  the  “on”  signal  level).  When 
equals  0,  those  frequencies  associated  with  a  logic  state  1  in  Uj  are  transmitted  at  a  level  low 
enough  to  represent  leakage  light  (thus  modeling  the  inability  to  completely  turn  off  the  light 
source),  while  those  frequencies  associated  with  a  logic  state  0  in  U;  are  not  transmitted  at  all  or 
at  a  very  low  level,  as  previously  explained.  This  effectively  produced  on-off  keyed  modulation  at 
each  of  the  transmitted  frequencies  of  the  broadband  source  which  are  summed  to  produce  a 
composite  signal  that  is  equivalent  to  a  wavelength  multiplexed  digital  signal.  At  the  receiver  the 
different  frequency  components  present  in  the  composite  signal  are  separated,  identified,  and 
extracted.  Photodetectors  are  used  to  produce  a  photon  count  that  is  proportional  to  the  light 
intensity  present.  The  overall  photon  count  associated  with  all  the  frequencies  that  can  be 
transmitted  for  the  U;  mask  is  compared  to  the  overall  photon  count  associated  with  all 
frequencies  that  can  be  transmitted  for  the  U;  mask.  If  the  former  exceeds  the  latter,  a  logic  1  is 
associated  with  d^  whereas  if  the  latter  exceeds  the  former,  a  logic  0  is  declared.  This  is 
equivalent  to  subtracting  the  two  values  of  the  photon  count  and  comparing  that  number  with 
zero.  The  masks  U;  were  generated  using,  both,  modified  PN-sequences  and  Walsh  codes. 

The  clear  advantage  of  this  method  versus  the  (unipolar)  previous  one  is  that  the  threshold  used 
to  decide  on  the  logical  state  associated  with  d^  is  independent  of  the  number  of  users  present. 
This  important  feature  of  the  bipolar  method  leads  to  a  system  that  is  both  implementable  and 
attractive  from  a  performance  standpoint  [4], 
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-  6.  THE  SIMULATION  SYSTEM  BLOCKS  - 


In  this  section,  the  development  of  the  computer  simulation  system  will  be  discussed  in  detail.  As 
previously  indicated,  a  software  simulation  toolkit  called  SPW  was  used  to  perform  computer 
simulations  of  the  O-CDMA  digital  communication  system.  Using  SPW,  the  communication 
system  operating  under  many  different  system  parameter  settings,  can  be  thoroughly  evaluated. 
An  advantage  of  SPW  is  that  it  is  block  oriented.  The  complete  system  can  be  subdivided  into 
subcomponents  or  blocks,  each  performing  one  particular  function. 

Some  of  the  simulation  run  times  were  found  to  be  lengthy,  especially  as  the  number  of  users  N, 
increased.  A  system  constructed  with  16  users  was  found  to  be  the  largest  one  whose 
performance  could  be  simulated  in  a  reasonable  amount  of  time.  Additionally,  the  amount  of  data 
being  processed  was  so  extensive  that  the  computer  system  used,  though  quite  modem,  simply 
could  not  handle  simulations  involving  greater  number  of  users.  With  16  users,  data-swapfiles  of 
sizes  bigger  than  220  Megabytes(!)  were  created  by  SPW  and  the  hard  disk  was  used  as 
temporary  storage  for  data  being  processed  by  the  software.  The  RAM  memory  (32  Megabytes) 
in  the  Sparc  workstations  was  not  sufficient  for  some  simulation  runs.  Expanding  the  simulation 
system  to  32  users  was  not  possible,  forcing  all  simulations  therefore  to  be  carried  out  for  a 
system  of  16  users.  In  order  to  explain  the  different  components  of  the  simulation  system,  only  4 
users  are  assumed  to  be  present  in  the  descriptions  that  follow.  The  number  of  users  in  the  O- 
CDMA  system  can  be  set  to  any  desired  number,  but  greater  computational  requirements  will 
result  when  carrying  out  simulations. 

Both  the  synchronous  and  the  asynchronous  O-CDMA  system  will  be  discussed  in  this  section, 
highlighting  the  differences  between  the  two. 


6.1  The  overall  system 

In  Figure  8,  major  components  of  the  overall  simulation  system  are  shown.  From  this  figure,  it  can 
be  seen  that  the  O-CDMA  system  roughly  consists  of  three  main  sections.  First,  there  is  the 
transmitting  section.  The  second  section  consists  of  the  receiver  and  the  third  section  represents 
the  transmission  medium,  namely  the  fiber  optic  network,  connecting  the  first  two  sections  to 
each  other.  In  the  transmitting  section,  blocks  labeled  “PN-generator”  and  blocks  labeled 
“encoder”  can  be  discerned.  The  PN-generator  block  provides  the  pseudo  random  binary  data  for 
each  individual  user.  The  encoder  block  is  used  to  convert  the  data  into  related  optical  sequences. 
The  encoder  blocks  are  user-unique,  which  means  that  each  encoder  assigns  an  individual 
signature  to  the  data  (in  the  form  of  a  code  mask)  of  the  corresponding  connected  user.  The 
output  of  all  individual  user  encoder  blocks  are  added  together,  representing  the  fiber  optic 
channel’s  input. 
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Fig.  8  The  simulation  system  with  four  users. 

The  receiver  section  consists  of  a  “pre-decoder”-  block  (“pre-”  is  short  for  preliminary)  followed 
by  the  four  user  unique  decoder  blocks.  The  pre-decoder  block  processes  the  channel  output  by 
separating  the  frequency  components  via  a  bank  of  bandpass  filters.  The  decoder  blocks  perform  a 
specific  signal  processing  operation,  that  is  unique  to  each  user. 

The  synchronous  system  operates  on  the  same  bit  rate  per  user  driven  by  a  master  clock,  while  in 
the  asynchronous  system  users  operate  on  different  bit  rates  with  unrelated  clocks.  Therefore, 
computer  simulations  of  the  asynchronous  system  differ  from  the  synchronous  system  in  the 
number  of  pre-decoder  blocks  present.  In  the  synchronous  system,  the  received  signal  is  applied 
directly  to  the  pre-decoder  block,  whose  output  feeds  a  separate  decoder  block,  one  for  every 
user.  The  asynchronous  system  however,  has  a  pre-decoder  block  for  each  user  whose  output  is 
connected  to  individual  decoder  blocks. 
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The  sampling  frequency  is  set  at  400  MHz  and  the  data  rate  at  100  KBps.  In  the  simulation  runs, 
these  as  well  as  other  parameters  were  varied  and  their  effect  studied.  The  sampling  frequency  can 
not  be  chosen  too  large,  or  the  system’s  limits  in  handling  data  will  be  reached.  It  was  determined 
that  a  sampling  frequency  of  400  MHz  offered  a  reasonable  compromise. 

The  simulation  results  are  intended  to  offer  qualitative  and  quantitative  information  on  the 
performance  of  O-CDMA  digital  communication  systems.  These  results,  as  discussed  Section  7, 
can  be  used  as  a  starting  point  for  the  development  of  practical  realizations  of  O-CDMA  digital 
communication  systems. 

The  individual  blocks  used  in  the  overall  simulation  system  will  now  be  discussed. 


6.2  The  PN-data  generator  block 

Function:  provides  the  pseudo  random  binary  data 

Parameters: 

-  sampling  frequency 

-  data  rate 

-  PN-formula  specification 
Description: 

The  transmitted  data  in  the  O-CDMA  digital  communication  system  is  of  random  binary  form.  At 
the  lower  level  of  this  block,  as  seen  in  Figure  9,  a  pseudo-random  generator  block  is  used  to 
simulate  the  random  data.  In  Section  3,  it  was  indicated  that  PN-code  sequences  have  periods  of 
variable  length.  By  setting  the  period  length  to  15  chips  and  assigning  every  user  a  shifted  version 
of  this  PN-sequence,  specific  data  bit  streams  are  assigned  to  each  of  the  individual  users.  (Note 
that  only  four  shifted  versions  of  the  15  chip  length  PN-sequence  are  needed  for  the  4  user  system 
and  that  the  16th  user  in  a  16  user  system  would  have  to  be  assigned  the  same  version  as  user  1). 
This  avoids  the  need  to  provide  signal  observation  (i.e.,  signal  sinks)  for  both  the  transmitted  data 
and  the  recovered  data  of  all  of  the  individual  users.  This  has  the  effect  of  reducing  simulation  run 
times  considerably. 

The  PN-data  generator  is  set  to  work  at  a  certain  data  rate  by  the  use  of  the  impulse  train  block 
and  the  inverter  block,  which  control  the  hold  input  of  the  PN-generator  block.  At  the  first 
simulation  iteration,  the  first  bit  of  the  PN-sequence  is  produced  and  held  for  a  certain  amount  of 
time  determined  by  the  value  of  the  parameter  “data  rate”  as  set  in  the  impulse  train  block. 
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Fig.  9  Lower  level  diagram  of  the  PN-code  generator  block. 

Figure  10  shows  the  data  signals  for  all  users  in  a  4  user  O-CDMA  digital  communication  system. 
Four  shifted  versions  of  a  PN-code  having  a  period  length  of  7  chips  are  shown. 


Tui  r  . . : . n  ( 

“ 1  , 

,J - 1 . 

u2  p— 

1  1 

i  ~~l  1  1  -j _ i — i . 

u3  i  i  i — i  , 

,  r-; 

n  j 

~  ■  r  ■  ,  .1 . 

u4  i 

i  t 

i 

i  .  r  ,  ,  l — . 

* - 

- ^ 

0 

T 

- > 

Fig.  10  User  data  for  four  users. 


6.3  The  encoder  block 
Function: 

The  encoder  block  is  used  to  convert  the  binary  random  data  into  (the  equivalent  of)  optical 
sequences. 
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Parameters: 

-  sampling  frequency 

-  data  rate 

-  power  level  for  transmitting  light 

-  power  level  for  transmitting  no  light 

-  power  level  for  light  leakage 

Description: 

Figure  1 1  shows  at  the  lower  level  the  input  and  output  of  the  encoder  block.  The  input  to  the 
encoder  block  is  the  output  of  the  PN-data  generator  block.  The  logical  states  in  the  binary 
random  data  input  signal  are  converted  into  integer  voltage  levels  in  the  “Binary  to  Numeric” 
block.  Figure  12  shows  at  the  lower  level  the  major  components  of  the  encoder  block.  Complex 
tone  generator  blocks  are  used  to  create  sinusoids  of  prescribed  frequencies.  The  complex  tone 
block  generates  a  pure  sine  wave  of  a  certain  frequency,  after  which  the  real  component  is 
extracted  and  used  to  produce  on-off  keyed  modulated  signals.  The  pseudo-random  data  is  used 
to  control  the  operation  of  switches  that  allow  these  tones  to  be  multiplied  either  by  a  unity  factor 
corresponding  to  a  logic  “1”  in  the  mask,  or  a  very  small  factor,  corresponding  to  a  logical  “0”  in  ’ 
the  mask.  (See  also  Section  5  in  this  report.) 

Unique  codes  selected  from  the  family  Walsh  codes  provide  an  individual  signature  for  each  user, 
e  binary  mask  is  represented  by  the  “onsig”  and  “lowsig”  inputs,  their  values  set  at  1  and  0  01 

respectively.  It  can  be  seen  from  Figure  12  that  the  system  implements  a  bipolar  code  to  encode  ’ 
the  user  information. 
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Fig.  11  Lower  level  diagram  of  the  encoder  block  (input  and  output). 
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Fig.  12  Lower  level  diagram  of  the  encoder  block  (major  components). 

6.4.  The  preliminary  decoder  block 
Function: 

In  the  pre-decoder  block  the  frequency  components  of  the  channel  output  are  separated  by  a  bank 
of  Butterworth  bandpass  filters. 

Parameters: 

-  sampling  frequency 

-  data  rate 

-  photo  diode  efficiency 

-  filter  bandwidth 
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Description: 

Figure  13  shows  at  the  lower  level  the  major  components  of  the  pre-decoder  block.  Each  filter  in 
the  pre-decoder  block  of  the  4  user  O-CDMA  system  has  a  center  frequency  of  2m  (in  MHz)  in 
which  m  =  1,  2, ,  8.  The  bandwidth  of  the  filters  is  made  large  enough  to  capture  the  energy 
associated  with  each  on-off  keyed  modulation  signal.  The  signal  at  the  output  of  the  filter  is 
squared  and  integrated  over  the  bit  duration  interval.  This  output  is  a  measure  of  signal  energy  at 
each  of  the  2m  (in  MHz)  frequencies. 

The  block  “impulse  train”  has  two  parameters  that  need  to  be  set.  One  is  the  impulse  rate  or  data 
rate  and  the  other  is  the  sampling  frequency.  The  output  of  the  block  will  give  a  pulse  after  a 
certain  number  of  samples  given  by 


f sample  _  ^ 

d  ~ 

In  Eq.  9,  fsampU  represents  the  sampling  frequency  (in  samples/sec)  and  d  the  data  rate  (in 
Bits/sec).  N  in  this  case,  represents  the  number  of  samples  contained  in  each  data  bit  interval.  In 
the  common  case  where  fsampU  =  400  Mhz  and  d=  100  kHz  then  N  equals  4000  samples/bit. 

The  delay  block  delays  the  signal  for  another  2500  samples  in  order  to  represent  the  delay  in  time 
between  sending  the  information  and  receiving  it.  The  pulse  called  “s_hold”  (sample  and  hold)  is 
high  at  sample  values  2500,  6500,  10500,  14500  etc.  This  impulse  train  is  connected  to  the  block 
“integrate  &  dump”  and  to  the  “switch”  block.  Every  time  the  pulse  is  high  the  “integrate  & 
dump”  block  integrates  the  received  signal  over  its  bit  duration,  stores  it  at  the  output,  and  is  reset 
to  zero. 

Each  “switch”  block  produces  a  constant  output  set  by  the  parameter,  which  has  been  set  to 
0.00000000001  in  this  case.  Only  when  the  “s_hold”  pulse  is  high,  a  switch  to  the  second  input 
takes  place  and  allows  the  value  calculated  by  the  “integrate  &  dump”  block  through.  Two  more 
blocks  are  added  to  model  the  photon  count.  Light  exists  as  photons  that  impinge  on  a 
photodetector.  The  detector  counts  the  number  of  photons  in  the  light  signal,  but  cannot  do  so 
exactly.  To  represent  this  non  ideal  nature  of  a  photodetector,  the  Poisson  block  is  added.  The 
Poisson  block  produces  a  Poisson  distributed  random  number,  that  is  proportional  to  the  input  of 
the  block.  The  output  of  this  block,  for  each  of  the  2m  (in  MHz)  frequencies,  represents  the 
energy  of  the  signal  at  that  frequency.  The  scalar  gain  block  in  front  of  the  Poisson  block  is  placed 
to  represent  the  efficiency  of  the  energy  conversion  process. 


6.5.  The  decoder  block 
Function: 

Final  decoding  of  the  received  lightwave  signals. 
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Parameters: 

-  sampling  frequency 

-  data  rate 

Description: 

Figure  14  shows  at  the  lower  level  the  major  components  of  the  decoder.  The  signals  represented 
by  the  ones  in  the  U;  code  mask  (see  Section  5.2),  are  added  together.  Also  the  signals 
represented  by  the  ones  in  the  complementary  code  mask,  0;,  are  added  together.  After 
subtracting  these  two  values,  the  result  is  compared  to  zero.  The  comparison  is  made  by  the 
“threshold”  block.  The  parameters  “pos  thresh”  and  “neg  thresh”  are  set  in  such  a  way  that  if  the 
value  at  the  input  is  higher  than  the  value  “pos  thresh”  a  1  is  declared  at  the  output,  and  when  it  is 
lower  than  the  “neg  thresh”  value,  a  -1  is  declared  at  the  output.  As  long  as  the  value  at  the  input 
is  between  “pos  thresh”  and  “neg  thresh”,  the  output  remains  unchanged.  Both  threshold 
parameters  are  set  to  zero  in  this  case,  which  means  that  the  output  will  be  1  when  the  input  value 
is  above  zero  and  the  output  will  be  -1  when  the  input  value  is  below  zero. 

The  “and”  block,  placed  after  the  “threshold”  block,  has  two  inputs.  The  first  input  is  the  output 
of  the  “threshold”  block  and  the  second  input  is  the  “s_hold”  pulse  described  in  Section  6.4.  The 
output  of  an  “and”  block  is  only  1  when  both  inputs  are  1.  In  all  other  cases  the  output  is  0.  The 
last  block  in  the  schematic  is  a  “latch”  block.  Its  inputs  are  s  (set)  and  r  (reset).  When  s  receives  a 
1  from  the  “and”  output  it  will  declare  a  1  at  the  Q  output.  The  output  remains  1  until  the  r  input 
becomes  a  1,  causing  the  Q  output  to  become  0.  The  output  will  become  1  again  when  the  s  input 
is  a  1,  and  so  on.  The  output  of  this  block  is  the  decoded  binary  data  signal. 


Fig.  14  Lower  level  diagram  of  the  decoder  block. 


6.6.  The  transmission  medium 


The  fiber  optic  transmission  medium  shown  in  Figure  8  is  considered  to  be  ideal  in  the  system 
under  study.  In  previous  work  the  channel  signal  was  allowed  to  propagate  through  a  soft  channel 
nonlinearity  which  for  simulation  purposes  had  been  allowed  to  take  the  form  y  =  Atanh(Bx), 
where  x  and  y  are  the  input  and  output,  respectively,  and  A  as  well  as  B  are  variable  parameters 
that  control  the  amplitude  and  the  degree  of  the  curvature  of  the  nonlinearity,  respectively.  After 
performing  specific  simulations  in  the  course  of  this  research  however,  it  was  found  that  the  effect 
of  the  nonlinear  function  of  the  transmission  block  was  negligible  for  the  frequency  components  in 
the  channel  signal  considered.  For  this  reason  and  the  assumed  short  transmission  distances  in  the 
O-CDMA  digital  communication  system  under  study,  ideal  channel  propagation  conditions  were 
simulated  and  therefore  the  transmission  block  was  not  used. 


-  7.  THE  SIMULATION  RESULTS  - 

Not  all  of  the  simulation  results  that  were  obtained  are  shown  in  this  report.  Only  those  results 
that  are  important  to  the  main  issues  addressed  in  this  study  are  described  and  tabulated  in  this 
section. 

All  the  simulations  are  performed  in  an  O-CDMA  digital  communication  system  for  15  or  16 
users.  For  simulations  performed  with  Walsh  codes  as  code  masks,  the  total  number  of  users  is 
16.  For  simulations  performed  with  modified  PN-sequences  as  code  masks,  the  total  number  of 
users  is  15.  As  explained  earlier,  this  number  proved  to  be  the  largest  number  of  users  that  could 
be  simulated  within  reasonable  computer  run  times.  Though  the  system  can  easily  be  expanded  to 
more  users,  this  would  require  greater  computational  capabilities. 

The  signal  labeled  “Transmitted  Data”  shows  the  data  that  has  to  be  transmitted  for  a  particular 
user.  The  signal  labeled  “Decoded  Output  User”  shows  the  decoded  output  signal  for  a  specific 
user.  The  signal  labeled  “error”  shows  the  number  of  errors  in  the  received  signal  compared  to  the 
signal  sent  by  the  user.  These  three  signals  are  shown  in  Figure  15. 

The  block  used  to  count  the  errors  in  the  received  signal,  makes  a  comparison  at  every  sample 
time  and  counts  the  differences  between  the  transmitted  data  and  the  decoded  output  signal.  The 
number  of  samples  per  bit  interval  is  different  for  simulations  with  different  data  rates.  To  get  the 
actual  number  of  errors  in  the  decoded  output  signal,  the  number  of  counted  errors  needs  to  be 
divided  by  the  number  of  samples  in  the  bit  interval.  The  columns  labeled  “Errors”  in  the  tables 
show  the  total  number  of  errors  for  the  total  number  of  users. 
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Each  simulation  has  been  performed  for  100,000  sample  points.  To  make  a  fair  comparison 
between  the  results,  a  column  “Average  error  rate”  is  added.  It  gives  the  number  of  errors  per 
user,  which  means  that  the  number  of  “Total  errors”  is  divided  by  the  product  of  the  number  of 
users  and  the  number  of  bit  intervals  observed. 


Fig .  15  Decoded  output  with  2  errors 


7.1  Varying  parameters  in  a  synchronous  system 

Table  5  shows  performance  results  associated  with  different  combinations  of  parameters  involving 
the  data  rate,  filter  bandwidth,  and  photodiode  efficiency  in  a  synchronous  system  with  Walsh 
code  masks.  The  data  rate  parameter  is  set  in  the  main  block  diagram,  the  “pre-decoder”  block, 
and  the  “decoder”  block  (impulse  train).  The  photodiode  efficiency  and  the  filter  bandwidth  are 
both  parameters  set  in  the  “pre-decoder”  block.  Parameters  involving  data  rate,  filter  bandwidth, 
and  photodiode  efficiency  are  varied  in  these  simulations. 


Combination  of  parameters  that  produced  errorless  results 

The  data  sent  is  received  without  errors  in  a  situation  where  the  sampling  frequency  is  400  MHz, 
the  frequency  components  of  the  masks  are  2m  (in  MHz)  with  m  =  1,  2,  3,  32,  the  data  rate  is 

100  KHz,  the  filter  bandwidth  is  500  KHz,  and  photodiode  efficiency  is  100,000.  The  results  of 
this  simulation  are  considered  the  reference  set.  One  parameter  is  changed  at  the  time  and  the 
results  of  those  simulations  are  compared  to  the  results  of  this  “reference”  simulation. 


Varied  parameter:  data  rate 

The  data  rate  is  tested  at  lower  and  higher  values.  The  higher  the  value  of  the  data  rate  becomes, 
the  more  errors  that  show  up  at  the  receiver.  From  the  results  of  the  simulations  it  can  be 
concluded  that  a  data  rate  higher  than  100  KHz  show  errors  at  the  receivers.  The  simulation  with 
a  data  rate  of  50  KHz  has  been  carried  out  to  show  that  a  lower  data  rate  will  result  in  perfect 
decoded  outputs  as  well. 
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Varied  parameter:  photodiode  efficiency 

The  simulation  results  show  that  a  photodiode  efficiency  of  less  than  100,000  is  not  enough  to  let 
the  process  run  without  any  errors.  To  allow  the  energy  conversion  process  be  efficient,  the  scalar 
gain  block  has  to  be  set  at  a  minimum  of  100,000. 


Varied  parameter:  filter  bandwidth 

The  filter  bandwidth  parameter  has  been  investigated  exhaustively.  In  all  of  the  situations 
considered,  there  are  errors  in  the  decoded  output  signal.  The  first  case  in  which  errors  were 
found,  when  the  bandwidth  is  smaller  than  the  reference  case,  is  at  a  bandwidth  of  400  KHz.  The 
bandwidth  of  the  filters  is  not  large  enough  to  capture  the  energy  associated  with  each  on-off 
keyed  modulation  signal.  In  a  case  of  a  broader  bandwidth,  the  errors  first  show  up  at  the 
receivers  when  the  filter  bandwidth  is  set  to  600  MHz.  The  errors  are  now  caused  by  the  fact  that 
the  filters  pick  up  signal  energy  from  adjacent  channel  users. 


Table  5  Results  of  varying  several  parameters  in  a  synchronous  system 


Varied  parameter:  Signal  to  Noise  Ratio 

White  Gaussian  noise  was  introduced  in  the  complex  tone  generators  in  the  encoder-blocks  in 
order  to  get  an  indication  of  the  influence  of  white  noise  on  correct  decoding  of  the  received 
signals.  Imperfect  decoding  was  found  to  be  present  for  a  Signal  to  Noise  Ratios  (SNR)  smaller 
than  6  dB.  Table  6  shows  the  amount  of  errors  present  with  the  SNR  set  at  3  dB. 


Table  6  Result  varying  Signal  to  Noise  Ratio 


Data  rate 
(Hz) 

nia 

Filter 

bandwidth  (Hz) 

SNR 

(dB) 

Total 

errors 

Average 
error  rate 

100,000 

100,000 

500,000 

3 

4 

0.010 

Varied  parameters:  filter  bandwidth  and  transmission  frequencies 

The  frequency  components  associated  with  the  masks  are  brought  closer  together  and  are  tested 
with  values  at  lm  (in  MHz)  with  m  =  2,  3,  4,  33  (mask  a),  and  with  values  at  0.5m  (in  MHz) 

with  m  =  4,  5,  6,  ...,  35  (mask  b).  The  only  combination  of  parameters  that  shows  performance 
results,  without  errors  involves  a  data  rate  of  100  MHz,  a  filter  bandwidth  of  400  MHz,  and  a 
photodiode  efficiency  of  100,000  in  a  system  with  frequency  components  lm  (in  MHz)  with  m  = 
2,  3,  4,  ...,  33.  In  all  other  cases,  errors  are  detected  at  the  receivers.  The  frequency  components 
of  the  masks  are  set  in  the  “encoder”  block  and  in  the  “pre-decoder”  block.  The  results  of  the 
simulations  are  shown  in  Table  7. 


Table  7  Results  varying  filter  bandwidth  and  transmission  frequencies 


Mask 

Data  rate 
(Hz) 

Filter 

bandwidth  (Hz) 

Total 

errors 

Average 
error  rate 

a 

100,000 

100,000 

250,000 

158 

0.395 

a 

100,000 

100,000 

350,000 

11 

0.028 

a 

100,000 

100,000 

400,000 

0 

0 

a 

100,000 

100,000 

500,000 

1 

0.003 

b 

100,000 

100,000 

200,000 

198 

0.495 
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Varied  parameters:  data  rate  and  “low”  signal  &  “no”  signal  levels 

In  the  “encoder”  block,  the  signals  “on”,  “low”,  and  ”no”  can  be  found.  The  value  of  the  on 
signal  represents  light  transmission  (light  is  “on”).  The  value  of  the  “low”  signal  represents 
leakage  of  light,  while  the  “no”  signal  value  represents  no  light  transmission  at  all.  The  values  of 
the  “low”  signal  and  “no”  signal  are  varied.  Table  8  shows  the  effect  of  variations  in  these 
parameters  on  the  decoded  output  signals  for  different  data  rates.  The  photodiode  efficiency  and 
the  filter  bandwidth  are  set  at  respectively  100,000  and  500  KHz  for  each  simulation. 


Table  8  Results  varying  parameters  data  rate  and  “low”  signal  &  “no”  signal  level 


GESS19H 

Total  errors 

Average  error  rate 

100,000 

0.01 

B 

0 

0 

0.05 

0.005 

0 

0 

150,000 

0.01 

0.001 

3 

0.005 

150,000 

0.001 

0.0001 

1 

0.002 

0.05 

0.005 

3 

0.005 

200,000 

0.01 

0.001 

14 

0.018 

0.001 

0.0001 

11 

0.014 

0.05 

0.005 

13 

0.017 

7.2  Walsh  codes  and  modified  PN-sequences  as  code  masks 

As  explained  in  Section  4,  the  cross-correlation  between  Walsh  codes  is  better  than  the  cross¬ 
correlation  between  modified  PN-sequences.  From  the  simulation  results  it  can  be  concluded  that 
the  Walsh  codes  used  as  code  masks  give  fewer  errors  in  the  decoded  output  signal  and  therefore 
it  is  a  better  code  to  use  as  code  mask.  To  make  a  fair  comparison  between  the  two  different 
codes  an  ideal  situation  for  sending  the  information  is  created.  Ideal  means  that  the  signal 
transmitted  and  received  has  not  been  affected  by  non-ideal  components.  Randomness  in  the 
photon  count,  modeled  by  the  use  of  the  Poisson  blocks  in  the  decoder,  will  prevent  a  direct 
comparison.  Therefore,  the  Poisson  blocks  are  removed. 
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As  explained  in  the  Section  6.5,  ahead  of  the  threshold  block  the  two  values  of  the  photon  count 
are  subtracted  and  that  number  is  compared  with  zero  by  the  threshold  block  to  decide  if  the 
output  is  high  (1)  or  low  (0).  For  a  high  output,  the  signal  energy  level  representing  the  ones  in 
the  U;  mask  has  to  be  higher  than  the  signal  energy  level  representing  the  ones  in  the  U;  mask.  The 
higher  the  number  after  subtraction  the  less  interference  the  signal  experiences  from  the  other 
signals.  In  Figure  16  and  Figure  17  it  is  shown  that  the  absolute  values  of  the  signals  before  the 
threshold  comparison  are  larger  for  Walsh  code  masks  than  the  values  obtained  for  the  modified 
PN-sequence  masks.  This  indicates  that  the  Walsh  code  used  yields  better  code  masks  in  the  sense 
that  there  is  a  smaller  likelyhood  of  decoded  errors. 
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Fig.  16  Absolute  value  of  the  signal  before  threshold  comparison  with  a  modified  PN-sequence 
as  code  mask. 
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Fig.  17  Absolute  value  of  the  signal  before  threshold  comparison  with  a  Walsh  code  as  code 
mask 


This  simulation  has  been  run  for  only  one  user.  To  prove  the  fact  that  the  Walsh  code  yields  a 
better  code  masks,  a  simulation  with  1 5  users  has  also  been  run  with  modified  PN-sequences  as 
code  masks  under  the  same  conditions  as  the  reference  case  in  subsection  7.1.  The  results  are 
presented  in  Table  9. 
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Table  9  Results  of  Walsh  codes  and  modified  PN-sequences  as  code  masks 


Data  rate 
(Hz) 

Photodiode 

efficiency 

Filter 

bandwidth 

(Hz) 

Total 

errors 

Average 
error  rate 

Walsh  codes 

100,000 

100,000 

500,000 

0 

0 

Modified  PN-sequences 

100,000 

100,000 

500,000 

1 

0.003 

7.3  Simulation  results  of  the  asynchronous  system 

For  the  asynchronous  system,  only  the  results  of  the  simulations  in  which  the  data  rates  were 
varied  are  presented.  The  way  of  tabulating  the  simulation  results  for  the  asynchronous  system  is 
different  as  seen  in  Table  10.  Each  simulation  has  been  performed  for  100,000  sample  points.  This 
means,  for  every  user  a  different  number  of  bits  is  observed.  Therefore,  the  total  number  of  errors 
are  presented  per  user  and  there  is  no  “Average  error  rate”  column. 

As  explained  in  Section  6.1,  each  user  has  its  own  data  rate.  The  data  rate  parameter  is  set  in  the 
“pre-decoder”  blocks,  and  the  “decoder”  blocks  (impulse  train).  In  Table  10  the  data  rates  of  the 
first  and  the  last  user  are  given.  The  data  rate  increases  by  2500  Hz  per  user.  The  parameters 
photodiode  efficiency  and  filter  bandwidth  are  respectively  100,000  and  500,000  for  each 
simulation. 

In  the  asynchronous  system  users  operate  on  different  data  rates  and  in  the  synchronous  system  all 
users  operate  on  the  same  data  rate.  Comparisons  between  the  two  systems  are  therefore  difficult 
to  make.  However,  it  can  be  concluded  that  increasing  data  rates  result  in  a  larger  number  of  total 
errors,  both  for  the  synchronous  and  asynchronous  system.  Furthermore,  it  can  be  concluded  that 
in  an  asynchronous  system  the  total  number  of  errors  in  the  decoded  output  signal  for  a  certain 
data  rate  is  larger  compared  to  the  corresponding  data  rate  in  the  synchronous  system.  To 
illustrate  this,  consider  the  following  case.  In  the  synchronous  system  operating  on  a  data  rate  of 
150  KHz  a  total  of  three  errors  were  present  in  decoded  output  signals  (See  Table  5).  In  the 
asynchronous  system,  the  one  user  operating  on  a  data  rate  150  KHz  experiences  a  total  of  5 
errors.  The  difference  in  these  results  can  only  be  related  to  the  presence  of  other  users  operating 
at  different  data  rates. 
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Table  10  Results  varying  the  data  rate  in  an  asynchronous  system 


User 

Total  errors 

62,500  -  100,000 

0 

82,500  -  120,000 

9 

102,500 

1 

13 

112,500 

1 

112,500-  150,000 

1 

112,500 

5 

2 

115,000 

6 

3 

117,500 

7 

D 

120,000 

6 

5 

122,500 

5 

6 

125,000 

6 

7 

127,500 

5 

8 

130,000 

5 

9 

132,500 

6 

10 

135,000 

6 

11 

137,500 

8 

12 

140,000 

7 

13 

142,500 

7 

14 

145,000 

6 

15 

147,500 

5 

16 

150,000 

5 
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-  CONCLUSIONS  - 


In  this  report,  simulation  results  on  user  data  detection  under  different  operational  scenarios  in  an 
Optical  Code  Division  Multiple  Access  (O-CDMA)  digital  communication  system  are  presented 
and  the  system  studied  is  explained.  The  simulation  results  obtained,  provide  information  on  the 
performance  of  the  specific  O-CDMA  digital  communication  system  with  16  users.  These  results 
can  be  used  as  a  guide  in  implementing  real  world  realizations  of  an  O-CDMA  digital 
communication  system.  The  number  of  users  present  in  the  O-CDMA  system  under  study  has 
been  limited  to  16,  in  order  to  remain  within  the  hardware  limitations  of  the  computer  system  used 
to  perform  the  simulation  runs. 

Potential  useful  codes  (or  masks),  other  than  modified  maximal  length  PN-codes  have  been 
investigated.  The  Walsh-codes  proved  to  be  easy  to  generate  and  additionally  provide  better 
correlation  properties  when  compared  to  modified  PN-codes. 

Simulation  results  for  both  a  synchronous  and  an  asynchronous  O-CDMA  system  have  been 
presented  in  this  report.  In  the  synchronous  system,  system  parameters  were  exhaustively  varied 
and  their  effect  on  performance  studied.  In  the  asynchronous  system,  the  effect  of  varying  similar 
system  parameters  as  in  the  synchronous  system  produced  correspondingly  equal  performance 
results.  Therefore  the  focus  in  the  asynchronous  system  has  been  the  effect  on  performance  due  to 
different  data  rate  settings  for  the  data  signals  of  the  16  users. 

Noise  effects  on  user  signals’  decoding  performance  have  been  investigated.  White  Gaussian  noise 
has  been  added  in  the  on-off  keyed  modulating  sinusoid  signals  in  the  encoder.  Results  show  that 
Signal  to  Noise  Ratios  smaller  than  6  dB  result  in  errors  in  the  decoded  outputs. 

Performance  results  focused  on  the  total  and  average  number  of  decoded  errors,  presented  in 
tabular  form.  In  general,  results  show  that  for  data  rates  higher  than  100  KHz,  errors  will  occur  in 
decoded  outputs.  Filter  bandwidth  settings  in  the  pre-decoder  to  filter  out  the  individual  frequency 
components  need  to  be  adjusted  with  great  care  in  accordance  with  the  frequency  component 
settings  in  the  encoder.  It  was  found  that  if  filter  bandwidths  are  chosen  too  small,  not  all  the 
energy  associated  with  each  on-off  keyed  modulation  signal  is  captured.  On  the  other  hand,  filter 
bandwidth  settings  chosen  too  large,  causes  picking  up  signal  energy  from  adjacent  channel  users. 
In  both  cases  this  causes  the  accumulation  of  significant  numbers  of  decoded  errors.  This  research 
also  showed  the  need  for  a  suppression  of  leakage  light  (“low”  signal,  representing  no  light 
transmission)  to  levels  in  the  encoder  blocks  that  are  20  dB  below  the  “on”  signal  level,  in  order 
to  guarantee  reasonable  average  error  rates. 
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MM4  MOOEL  EXPERIMENTS  ON  THE  EFFECTS  OF  CLOUD  SHADING 


Chia-Bo  Chang 
Associate  Professor 
Department  of  Geosciences 
Texas  Tech  University 

Abstract 

This  study  examines  the  effects  of  cloud  shading  on  mesoscale  numerical  weather  prediction.  These 
effects  are  evaluated  based  on  real-data  numerical  simulation  experiments  using  the  PSU/NCAR  MM4 
model.  Empirical  formulas  combining  relative  humidity  and  the  dynamically  predicted  cloud  water 
with  a  weighting  factor  were  used  to  determine  cloud  fraction.  Four  simulation  experiments  with 
different  weighting  factors  were  carried  out.  The  structure  of  model  cloud  fraction  varied 
considerably  from  the  experiment  based  on  RH  alone  to  the  experiment  weighted  heavily  on  cloud 
water.  The  RH-based  cloud  fraction  had  a  broad-scale  structure,  while  the  cloud  fraction  associated 
with  cloud  water  showed  small-scale  features. 

Among  all  experiments,  the  1 2-h  model  simulated  surface  flow  and  temperature  fields  quite 
resemble  each  other  despite  the  siiyiificant  structure  changes  in  cloud  fraction.  Some  notable 
differences  are  shown  at  the  24-h  simulations.  These  appear  to  contradict  what  anticipated.  During 
the  daytime  (the  first  twelve  hours),  large  changes  in  cloud  fraction  are  expected  to  alter  solar 
heating  at  the  surface  and  hence  the  circulations,  while  at  night  (the  second  twelve  hours)  cloud 
shading  is  unlikely  to  have  large  influences  on  the  surface  energy  budget.  However ,  the  responses  of 
motion  and  thermal  fields  to  surface  heating  may  vary  from  case  to  case.  Clearly,  more  case  studies 
and  analyses  are  needed  to  understand  the  role  of  cloud  shading  in  the  surface  energy  budget. 

About  the  same  amount  of  non-convective  rainfalls  were  produced  in  all  experiments,  and 
convective  rainfalls  increased  somewhat  with  increasing  weight  for  cloud  water  in  the  cloud  fraction 
equations.  Unfortunately,  at  this  time  we  do  not  have  adequate  data  for  a  quantitative  assessment  of 
precipitation  forecasting.  In  comparison  with  the  daily  weather  map,  which  revealed  very  little 
precipitation  in  eastern  U.S.  during  the  24-h  period,  all  experiments  showed  the  similar  skill. 
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Introduction 


In  the  atmosphere,  weather  systems  develop  in  response  to  horizontal  difference  in  radiative 
heating.  On  the  other  hand,  the  cloud  and  moisture  distribution  associated  with  these  systems 
constantly  modifies  the  radiation  budget.  Clouds,  the  principal  modulator  of  radiation  in  the 
atmosphere,  can  greatly  reduce  the  outgoing  infrared  radiation  and  cooling  at  the  surface.  Also,  a 
substantial  depletion  of  insolation  takes  place  at  the  cloud  top  because  of  its  large  albedo  which,  except 
for  cirrus,  is  on  the  order  of  0.5  (Stephens  and  Webster,  1981).  This  depletion  process  may  not  be 
significant  so  far  as  the  absorption  of  solar  radiation  in  the  atmosphere  is  concerned.  However,  it  can 
have  a  controlling  effect  on  surface  energy  budget  over  land. 

During  the  daytime,  solar  heating  results  in  a  surplus  of  radiative  energy  8t  the  surface.  Some  of 
the  surplus  in  energy  will  be  transferred  from  ground  to  air  in  the  form  of  sensible  and  latent  he8t 
fluxes.  The  intensity  of  sensible  heat  flux  (surface  heating)  is  largely  dependent  upon  the 
characteristics  of  surface  and  the  atmospheric  conditions,  for  example,  cloud  cover.  The  disruption  of 
radiative  heat  transfer  due  to  cloud  shading  can  significantly  influence  the  surface  temperatures  as 
well  as  the  thermal  structure  (e.g. ,  static  stability  and  regional  baroclinicity)  of  the  lower 
troposphere.  Surface  temperature  is  the  essential  element  in  defining  surface  energy  sources  for  the 
development  of  weather  systems  in  numerical  weather  prediction  ( NWP )  models.  Mesoscale  weather 
systems  are  often  sensitive  to  change  in  low-level  static  stability.  Upton  ( 1 993)  showed  that  the 
assimilation  of  cloud  shading  derived  from  satellite  cloud  information  could  improve  the  model 
prediction  of  surface  temperatures.  As  a  result,  the  model  was  able  to  simulate  more  realistic 
mesoscale  circulations.  Chang  ( 1 995)  showed  that  cloud  shading  had  a  direct  impact  on  the  surface 
temperature  forecasting  and  mesoscale  circulations  over  land. 

This  study,  a  continuation  of  the  summer  research  effort  of  the  principal  investigator ,  further 
examines  the  effects  of  cloud  shading  on  short-range  mesoscale  NWP  using  real-data  model  simulation 
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experiments.  The  mesoscale  modeling  system  known  as  MM4  developed  at  the  Pennsylvania  State 
University  ( PSU)  and  National  Center  for  Atmospheric  Research  ( NCAR)  Is  employed  for  the 
experiments. 

The  current  urn  parameterization  of  cloud  fraction  Is  based  on  relative  humidity  ( RH).  A 
modified  approach  involving  the  cloud  water  for  determining  the  model  cloud  fraction  is  formulated  in 
this  study.  It  is  intended  to  test  the  hypothesis  that  the  inclusion  of  dynamically  predicted  cloud  water 
will  improve  the  estimate  of  cloud  fraction  and  consequently  the  model  prediction.  This  study  will 
provide  useful  information  for  a  better  understanding  of  air- land  interaction  in  mesoscale  NWP. 

Modeling  system 

A  brief  overview  of  MM4  system  summarized  from  Gill  ( 1 992)  and  Grell  et  al.  ( 1 994)  is  given  in 
this  section.  The  MM4  dynamic  prediction  model  is  a  three-dimensional  primitive  equation  model  using  a 
o  grid  in  the  vertical  and  a  Cartesian  grid  casting  on  a  Lambert  conformal  projection  in  the  horizontal. 

The  model  basic  equations  are  the  equations  of  motion,  hydrostatic  equation,  first  Isw  of 
thermodynamics,  moisture  conservation  equations  for  water  vapor,  cloud  water,  snd  rainwater.  The  model 
physics  are  as  follows: 

1 )  Parameterization  of  subgrid-scale  moist  convection, 

2)  Dry  convective  adjustment, 

3)  Microphysics  in  the  grid-scale  cloud  and  precipitation  processes, 

4)  Planetary  boundary  layer  (PBL)  processes, 

5)  Atmospheric  radiation  and  surface  energy  budget. 

In  this  study  the  model  initial  state  wss  obtained  from  the  objective  analysis  of  the  National  Weather 
Service  ( NWS)  upper-air  and  surface  observations.  The  National  Meteorological  Center  ( NMC)  gridded 
data  was  used  ss  a  first  guess  in  the  analysis.  The  dynamic  model  used  in  this  study  had  29  levels  with  the 
o  values  ranging  from  1  at  the  surface  to  0  at  the  model  top  (~  50  mb)  and  a  rather  higher  resolution  in 
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the  PBL  (about  1 0  o  levels  below  900  mb).  There  were  50  x  55  grid  points  in  the  horizontal  and  the 
grid  size  was  45  km.  The  time-dependent  conditions  generated  from  the  observations  were  Imposed  on 
the  lateral  boundaries.  Other  numerical  aspects  of  the  model  including  the  horizontal  grid  structure, 
finite  difference  approximation  for  space  8nd  time  derivatives,  and  lower  8nd  upper  boundary 
conditions  were  summarized  In  the  psper  by  Modica  et  al.  ( 1 994).  A  modified  Arakawa- Schubert 
scheme  and  the  associated  microphysics  in  the  model  cloud  Wats'  equation  were  employed. 

In  the  earlier  experiments  (Chang,  1 995),  a  two-way  Interactive  rested  grid  was  used  for  1 2-h 
integration.  The  grid  mesh  sizes  were  45  km  and  15  km,  respectively.  However ,  the  nested  grid  was 
too  much  time-consuming  to  run  and  the  differences  between  the  coarse-mesh  and  fine- mesh  runs 
were  not  very  significant  In  this  study,  only  the  coarse-mesh  model  simulations  with  a  longs'  time 
integration,  24-h  instead  of  1 2-h,  were  performed. 

The  focus  this  study,  of  course,  is  on  how  cloud  shading  alters  the  surface  energy  budget.  The 
surface  energy  budget  involved  incident  solar  and  infrared  irradiance,  sensible  heat  flux  and  latent 
heat  ( moisture)  flux  into  the  atmosphere,  and  heat  conduction  into  the  sub-surface  lays'.  The  surface 
heat  fluxes  were  determined  using  a  bulk-aerodynamical  formulation.  In  the  radiation  computation, 
only  water  vapor  was  considered  as  an  optically  active  gas  in  the  model  atmosphere,  and  radiative 
heating/cooling  in  the  model  atmosphere  was  neglected.  The  ground  temperatures  and  heat  flux  were 
computed  from  the  surface  energy  budget  equation. 

The  cloud  attenuation  of  solar  radiation  and  the  enhancements  of  infrared  radiation  were 
parameterized.  The  reduction  of  insolation  at  the  surface  would  result  in  lower  ground  temperatures 
and  consequently  smaller  sensible  teat  flux  and  moisture  flux.  Three  cloud  layers  ( low,  middle,  and 
high  clouds)  were  considered.  The  original  empirical  formulas  for  cloud  fraction  ( n)  based  on  RH 
were,  respectively, 

n  =  4  RH  -  3,  (1) 

for  the  lower  atmosphere  ( between  o  =  0.96  and  0.75)  and  middle  atmosphere  ( between  o  =  0.75  and 
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0.35),  and 

n  =  2.5  RH  -  1.5,  (2) 

for  the  upper  atmosphere  ( o  i  0.35).  RH  was  the  maximum  relative  humidity  found  in  the  respective 
layers.  Note  that  although  cloud  water  was  predicted  in  the  model ,  cloud  fraction  was  not  directly 
correlated  with  the  model  cloud  water  distribution.  The  evaporation  of  cloud  water  was  invoked  In  the 
cloud  water  equation  if  RH  <  1 .  Hence  the  existence  of  cloud  water  implied  that  the  air  W8S  saturated 
and  n  would  be  1 . 

The  above  empirical  formulas  for  cloud  fraction  were  modified  to  Include  model  cloud  water  for  the 
simulation  experiments  as  described  in  the  following  section. 

Methodology 

The  real-data  model  simulation  experiments  were  conducted  on  the  23-24  August  1 993  case.  The 
experiments  were  aimed  at  a  quantitative  assessment  of  the  role  of  cloud  shading  in  regional 
circulation.  The  case  study  represented  a  weak  synoptic  situation.  No  significant  meteorological  event 
occurred  in  the  United  States  during  the  24-h  period.  This  case  was  selected  for  an  ongoing  satellite- 
model  coupled  analysis  research  at  Atmospheric  Science  Division  .Phillips  Laboratory.  The 
availability  of  satellite  cloud  imageries  could  provide  information  for  a  qualitative  verification  of  the 
model  cloud  cover. 

The  modified  empirical  formulas  for  cloud  fraction  (m)  for  the  lower,  middle,  and  upper  layers  of 
the  model  atmosphere  are-. 

m  =  w  n  0  <  c  <  a, 

m  *  w  n  +  ( 1  -  wV[(c  -  a)/(b  -  a)]  eicib,  (3) 

m  =  w  n  +  ( 1  -  w)  c>b, 

where  c  is  the  layer-mean  cloud  water  in  gm/gm ,  w  is  a  weighting  factor  ranging  from  0  to  1 ,  a  end  b 
are  two  threshold  values.  If  c  <  a,  cloud  water  is  ignored.  The  impact  of  cloud  water  on  cloud  fraction 
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Increases  for  a  i  c  s.  b  aid  levels  off  for  c  >  b  .  For  the  lower  end  middle  atmosphere,  8=10*®  gm/gm 
and  b  =  10”*  gm/gm,  anda=  1 0*6  gm/gm  and  b  =  5x1  O'5  gm/gm,  respectively,  and  n  is  defined  In 
( 1 ),  while  for  the  upper  atmosphere,  8  =  1 0*6  gm/gm  and  b  =  4  x  1 0*s  gm/gm,  aid  n  Is  same  as  that 
defined  In  (2).  The  higher  w  the  heavier  RH  is  weighed  in  (3).  For  w  =  0,  m  is  completely 
determined  by  c,  while  for  w  =  1 ,  m  =  n  and  cloud  fraction  is  a  function  of  RH  only  as  In  ( 1 )  aid  ( 2). 

Consider  the  case  of  equal  weight  (w  =  0.5)  for  c  and  RH  as  8n  example.  If  c  <  a  and  n  -  1,  m  =  0.5 
or  50*  of  cloud  cover.  If  8scsbandn  =  1,0.5imin,and  ifc>bandn=l,m  =  noran 
overcast  sky.  In  other  words,  1 00*  cloud  cover  occurs  only  when  a  relatively  large  amount  of  cloud 
water  is  predicted,  aid  n  is  reduced  to  m  If  the  model  cloud  water  is  less  than  b.  On  the  other  hand,  if  n 
<  1  (e.g.,  0.5)  8nd  c  >  a,  n  is  raised  to  m  ( >  0.5).  Note  that  for  w  <  1  except  that  n  =  1  and  c  >  b  m  is 
always  less  than  1 . 

The  use  of  w  smooths  out  the  sharp  gradient  in  cloud  fraction  between  the  areas  of  cloud/  and  clear 
skies  shown  in  the  earlier  experiments  (Chang,  1 995).  The  threshold  values  a  and  b  are  somewhat 
arbitrary.  The  constant  a  is  more  than  two  orders  of  magnitude  below  the  typical  liquid  water  content 
in  stratus  clouds  ( Rogers,  1 979).  Also,  the  MM4  model  has  been  modified  to  include  cloud  fraction  as 
one  of  the  2-D  variables  stored  in  the  model  output  history. 

The  model  simulation  experiments  conducted  are  as  follows; 

1 )  Experiment  1  ( E 1 )  was  a  24-h  forecast  with  w  =  1 . 

2)  Experiment  2  ( E2)  was  similar  to  E 1  except  that  w  =  0.75. 

3)  Experiment  3  ( E3)  W8S  similar  to  E 1  except  that  w  =  0.5. 

4)  Experiment  4  ( E4)  was  similar  to  E  l  except  that  w  =  0.25 

El  is  regarded  as  a  benchmark  run.  The  model  output  of  El  is  compared  with  observations 
( gridded  fields)  for  a  qualitative  evaluation  of  the  model  performance.  Different  combinations  of  RH 
and  cloud  water  used  to  estimate  cloud  fraction  are  tested  in  E2,  E3,  and  E4.  Comparisons  between  the 
individual  forecast  experiments  sre  made  to  see  which  w  value  may  be  more  appropriate  for  the 
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estimate  of  cloud  fraction  and  consequently  leads  to  a  better  model  performance.  The  variations  In 
surface  energy  sources  due  to  any  changes  In  clow!  fraction  sre  expected  to  have  a  greater  Impact  on 
the  surface  th8n  upper-air  circulations.  Therefore,  a  careful  examination  of  the  model  surface  winds 
and  temperatures,  and  precipitation  is  conducted  for  the  forecast  comparisons. 

Results  of  the  model  simulations 

The  model  was  initialized  at  1 200  UTC  23  August  1 993  for  all  experiments.  Figure  1  shows  the 
model  domain  covering  the  eastern  half  of  the  United  States.  The  Appalachian  Mountains  were  well 
represented  in  the  model  terrain.  Figure  2  shows  the  analyzed  initial  surface  (o  =  0.995, 
approximately  50  m  above  the  ground)  vector  winds  and  temperatures.  The  relatively  large 
temperature  gradient  zones,  one  over  Kansas  and  Missouri  end  the  other  extending  from  Illinois 
southeastward  to  Georgia,  were  associated  with  the  surface  cold  and  wsrm  fronts,  respectively.  The 
wind  field  displayed  some  direction  shifts  across  the  fronts  in  eastern  Kansas  snd  in  the  southeastern 
states.  Also,  the  advection  of  warm  air  from  the  south  toward  the  north-central  states  was  quite 
evident.  The  southern  portions  of  the  fronts  started  to  dissipate  in  the  next  1 2  hours.  The  synoptic 
setting  suggested  little  upper-air  forcing  over  the  eastern  United  States. 

Figure  3  shows  the  1 2-h  simulated  surface  vector  winds  and  temperatures  8t  0000  UTC  2-4  August 
1 993  for  E 1 ,  while  Fig.  4  shows  the  corresponding  observations.  The  observed  temperature  patterns 
indicated  the  eastward  movement  of  cool  air  behind  the  cold  front  toward  Illinois  and  Missouri.  The 
cool  air  pushed  the  northward  intrusion  of  warm  air  to  the  southwest  of  the  warm  front  toward  Indiana 
and  Ohio.  A  major  forecast  error  occurred  in  Illinois  and  Missouri  where  the  model  was  5- 1 0*0 
warmer  thsn  the  observation.  The  observed  cool  air  centered  in  Illinois  was  misplaced  too  far  to  the 
west  and  the  northeastward  intrusion  of  warm  sir  was  not  as  deep  in  the  model.  The  model  performed 
better  in  the  upper  air  ( not  shown);  the  forecast  errors  were  on  the  order  of  1  *C.  The  basic  flow 
patterns  at  the  surface  were  well  predicted.  As  in  the  case  of  temperature,  the  model  overpredicted 
the  wind  speeds  in  Missouri,  Illinois,  and  Indiana.  Minor  errors  included  the  erroneous  northerlies  in 
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southern  Alabama  and  the  easterlies  near  the  west  coast  of  Florida.  Much  better  agreement  with  the 
observations  was  found  over  the  water. 

F igure  5  shows  the  24-h  simulated  surface  vector  winds  and  temperatures  at  1 200  UTC  24  August 
1 993  for  E 1 .  During  the  1 2-h  period,  some  notable  charges  in  the  flow  field  occurred  over  the 
northeastern  states,  and  North  Carolina  where  8  small  anticyclone  formed.  Also,  along  the  coestal 
areas  of  western  Florida,  Alabama,  and  Mississippi ,  there  were  clear  increases  in  the  off-shore  flows 
resembling  a  land-breeze  circulation.  The  1 2-h  changes  in  the  temperature  field  were  quite  drastic. 
Over  laid  there  were  warming  near  the  northern  boundary  and  cooling  in  other  regions  ranging  from 
about  1 0*C  over  Missouri  to  a  few  degrees  in  the  coastal  arees. 

Figure  6  shows  the  1 2-h  and  24-h  simulated  cloud  fractions  for  E 1 .  There  was  extensive  low- 
cloud  cover  over  the  eastern  states.  Between  1 2  h  and  24  h  the  low  aid  middle  clouds  decreased 
considerably  in  the  west,  while  an  increase  in  high  cloud  in  the  northeast  was  observed.  Superposing 
the  three  layers  of  clouds,  we  notice  rather  cloudy  conditions  in  the  east  at  both  times.  Fig.  7  shows 
the  24-h  accumulated  total  and  convective  precipitation.  Most  rainfalls  were  non-convective  except 
over  the  Orest  Lakes  area.  This  was  consistent  with  the  increase  of  high  clouds  to  the  east  of 
convection. 

As  pointed  out  by  Chang  ( 1 995)  earlier  the  large  errors  in  the  surface  temperature  forecast  In  E 1 
over  Illinois  and  Missouri  appeared  to  be  caused  by  too  much  surface  heating.  It  was  noted  that  the 
model  failed  to  reproduce  the  light  rainfall  (about  half  inch  in  24  h  as  revealed  In  the  dally 
weather  maps)  In  Illinois  and  Missouri.  The  dry  surface  In  conjunction  with  little  cloud  cover 
contributed  to  the  high  ground  temperatures  and  too  strong  surface  heating  In  the  model. 

Figure  8  shows  the  1 2-h  and  24-h  simulated  cloud  fraction  for  E2.  The  basic  patterns  aid  areas 
of  cloud  cover  sre  similo'  to  those  of  E 1 .  But  for  some  areas  where  the  model  did  not  predict  cloud 
water  the  sizes  of  cloud  fraction  were  somewhat  reduced.  On  the  other  hand,  the  areas  with  predicted 
cloud  water ,  for  example  off  the  east  coast  of  Florida,  we  see  clear  changes  in  cloud  fraction.  The  24-h 
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simulated  total  and  convective  precipitation  ( not  shown)  are  very  similar  to  those  of  E 1 .  Also,  there 
are  no  significant  differences  in  the  predicted  surface  temperatures  and  winds  between  E 1  and  E2. 

Figure  9  shows  the  1 2-h  and  24-h  simulated  cloud  fraction  for  E3.  The  use  of  w  =  0.5  greatly 
altered  the  structure  of  cloud  fraction.  Clouds  oriented  in  lines  were  embedded  in  the  broad-scale 
patterns.  The  structure  suggests  that  in  the  middle  and  higher  atmosphere  the  model  predicted  little 
cloud  water  over  the  western  half  of  the  domain.  Figs.  1 0  and  1 1  show,  respectively,  the  1 2-h  and 
24-h  model  surface  vector  winds  and  temperatures  for  E3.  At  1 2  h  E 1  and  E2  were  very  similar  at 
the  surface  level.  At  24  h,  over  the  eastern  U.S. ,  E3  was  slightly  cooler  than  E 1 .  Some  differences 
from  El  in  the  surface  winds  were  found  over  the  southeastern  states  (e.g. ,  Tennessee  and  North 
Carolina).  Fig.  12  shows  the  24- h  simulated  total  and  convective  precipitation  for  E3.  The  enhanced 
maximums  in  western  Virginia  and  at  the  west  coast  of  Florida  did  not  exist  in  E 1 .  Both  were  a  result 
of  convection. 

Figure  13  shows  the  1 2-h  and  24-h  simulated  cloud  fraction  for  E4.  There  are  striking 
discrepancies  in  cloud  fraction  between  El  and  E4.  The  areas  of  high  cloud  fraction  based  on  RH  do  not 
agree  well  with  those  according  to  cloud  wster.  The  24-h  total  precipitation,  surface  temperatures, 
8nd  winds  ( not  shown)  resemble  those  of  E3. 

Summary 

Based  on  the  real-data  model  simulation  experiments  Chang  ( 1 995)  showed  that  the  effects  of 
cloud  shading  should  not  be  overlooked  in  short-term  mesoscele  NWP.  Without  cloud  shading  the  model 
surface  temperatures  were  too  high  and  consequently  the  model  atmosphere  became  less  stable  causing 
erroneous  convection.  To  resolve  the  essentials  of  shading  effects  in  the  model  an  accurate  estimate  of 
cloud  fraction  is  needed. 

In  most  NWP  models,  cloud  fraction  is  parameterized  based  on  RH.  In  this  stud/,  however, 
empirical  formulas  involving  both  RH  and  the  dynamically  predicted  cloud  water  were  used  to 
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determine  cloud  fraction.  It  was  intended  to  reveal  whether  these  new  formulas  might  provide  a  better 
estimate  of  cloud  fraction  and  consequently  improve  the  model  performance.  Four  real-data  model 
experiments  using  the  formulas  with  different  weighting  factor  for  RH  and  cloud  water  were  carried 
out.  The  structure  of  model  cloud  fraction  varied  considerably  from  the  experiment  based  on  RH  alone 
to  the  experiment  weighted  heavily  on  cloud  water.  The  RH-based  cloud  fraction  had  a  broad-scale 
structure,  while  the  cloud  fraction  associated  with  cloud  water  showed  small-scale  features. 

Among  all  experiments,  the  1 2-h  model  simulated  surface  flow  and  temperature  fields  quite 
resemble  each  other  despite  the  siiyiificsnt  structure  changes  in  cloud  fraction.  Some  notable 
differences  sre  shown  at  the  24-h  simulations.  These  appear  to  contradict  what  anticipated.  During 
the  daytime  ( the  first  twelve  hours),  large  changes  In  cloud  fraction  are  expected  to  alter  solar 
heating  at  the  surface  and  hence  the  circulations,  while  at  night  (the  second  twelve  hours)  cloud 
shading  is  unlikely  to  have  large  influences  on  the  surface  energy  budget.  However ,  the  responses  of 
motion  and  thermal  fields  to  surface  heating  ms/  vary  from  case  to  case.  Clearly,  more  case  studies 
and  analyses  sre  needed  to  understand  the  role  of  cloud  shading  In  the  surface  energ/  budget. 

About  the  same  amount  of  non-convectlve  rainfalls  were  produced  In  all  experiments,  and 
convective  rainfalls  increased  somewhat  with  increasing  weight  for  cloud  water  in  the  cloud  fraction 
equations.  Unfortunately,  at  this  time  we  do  not  have  adequate  data  for  a  quantitative  assessment  of 
precipitation  forecasting.  In  comparison  with  the  daily  weather  map,  which  revealed  very  little 
precipitation  in  eastern  U.S.  during  the  24-h  period,  all  experiments  showed  the  similar  skill. 
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Fig.  1.  The  model  domain  snd  terrain  (m) 
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Fig.  2.  The  model  initial  surface  vector  winds  ( knots)  and  temperatures  ( 6C)  for  E 1  at  1 200  UTC 
23  August  1993. 
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Fig.  3.  The  1 2-h  simulated  model  surface  vector  winds  (knots)  and  temperatures  (°C)  for  E 1  at 
OOOO  UTC  2-4  August  1993. 
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Fig.  10.  As  in  Fig.  3  except  for  E3 


Fig.  \  1 .  As  in  Fig.  5  except  for  E3 


Miniature  Laser  Gyro  Consisting  in  a  Pair  of  Unidirectional  Ring  Lasers 


Jean-Claude  Diels 
Professor 

Department  of  Physics  and  Astronomy 


University  of  New  Mexico 
Albuquerque,  NM  87131 


Final  Report  for: 

Summer  Research  Extension  Program 
Phillips  Laboratory 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  Air  Force  Base 
Washington,  D.C. 

and 

Phillips  AFB 


December  1996 


Miniature  Laser  Gyro  Consisting  in  a  Pair  of  Unidirectional  Ring  Lasers 


Jean-Claude  Diels 
Professor 

Department  of  Physics  and  Astronomy 
University  of  New  Mexico 
Albuquerque,  NM  87131 


ABSTRACT 

Very  thin  (compared  to  the  wavelength)  gain  and  dielectric  layers  have  similar  reflection 
properties,  except  for  a  phase  factor  of  n/2.  This  particular  difference  makes  it  possible  to  design 
non-reciprocal  elements,  having  a  zero  reflectivity  from  one  direction,  and  a  finite  reflectivity  from 
the  other  direction.  Unidirectional  operation  of  a  ring  laser,  based  on  this  property,  has  been 
demonstrated. 
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MINIATURE  LASER  GYRO  CONSISTING  IN  A  PAIR 
OF  UNIDIRECTIONAL  RING  LASERS 

Jean-Claude  Diels 


Objective 

The  short  term  objective  was  successfully  achieved,  i.e.  is  to  demonstrate  unidirectional  operation  of  a  ring  laser  with 
a  free  standing  multiple  quantum  well  structure  as  its  gain  medium. 

An  extension  of  this  program  could  lead  to  a  long  term  objective,  which  is  to  demonstrate  gyro  response  of  a  dual 
ring  laser  operating  on  a  small  cube  of  semiconductor  material. 

Principle  of  “non-reciprocity” 

The  principle  of  the  asymmetry  in  reflection  of  the  structure  can  best  be  understood  by  considering  just  a  pair  of  two 
ultrathin  (thickness  C  wavelength)  reflecting  layers,  one  being  a  dielectric  “delta  function”,  and  the  other  a  pure  gain 
discontinuity.  This  ultra-simplified  pair  provides  a  base  of  understanding  for  the  more  complex  multilayer  structure  of 
the  MQW.  We  will  consider  successively  single  dielectric  layers,  single  gain  layers  (imbedded  in  a  dielectric  with  the 
same  real  part  for  the  dielectric  constant),  pairs  of  layers,  and  finally  more  complex  structures. 

Because  one  is  used  to  deal  with  continuous  structures  of  dimension  larger  than  the  wavelength,  optical  properties  of 
such  structures  are  counter-intuitive.  For  instance,  the  usual  concept  of  a  “self-saturation”  and  a  “mutual  saturation” 
does  not  apply  to  the  ultrathin  gain  medium.  The  saturation  is  uniquely  determined  by  the  value  of  the  local  field  at 
the  layer,  which  could  be  the  node  or  antinode  of  a  standing  wave.  Unlike  the  situation  in  a  continuous  gain  medium, 
the  field  reflection  of  the  single  gain  layer  is  comparable  to  the  field  gain  in  transmission. 

Single  layer 

Let  us  consider  first  a  single  dielectric  slab  imbedded  in  a  uniform  dielectric  medium,  excited  by  a  linearly  polarized 
plane  wave  with  electric  field  amplitude: 

E(t)  =  E(t)  +  c.c.  =  +  c.c.  (1) 

The  slab  being  made  of  a  layer  of  dipoles  with  polarizability  xeo?  the  total  polarization  is  P  =  x&{z)eoE.  Inserting 
this  polarization  as  a  source  term  in  Maxwell’s  equations: 

-  +  k2E  =  -noto  u}xS{z)E  =  (2) 
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We  will  use  o,  t  and  t  as  indices  for  the  incident,  reflected  and  transmitted  waves,  respectively.  Continuity  of  the 
tangential  component  of  the  field  across  the  dielectric  layer  imposes  that: 


£o  +  =  (1  +  r)£o  —  St  =  t£o- 

Integrating  Maxwell’s  equation  (2)  across  the  dielectric  layer: 


- ik£t 


Taking  into  account  the  relation  (3),  we  find: 


-ik£ o  +  ik£r 


= 


X^o(l  +  r). 


1  +  r  '2nc 

where  n  is  the  index  of  refraction  of  the  host  medium.  Solving  for  the  reflection  and  transmission  coefficient: 

-ip 


r  = 


*  = 


1  +  iP 
1 


(3) 

(4) 

(5) 

(6) 
(7) 


1  +  ip' 

Since  p  is  proportional  to  the  susceptibility,  for  a  gain  medium,  we  will  have  a  reflection  and  transmission  of  the 
form: 


r 

t 


a 

1  —  a 
1 

1  —  a' 


(8) 

(9) 


These  two  expressions  apply  as  well  for  an  absorbing  medium,  and  have  been  checked  numerically  using  matrix 
multiplication  methods.  While  the  agreement  is  excellent  for  small  values  of  a,  there  is  some  discrepancies  for  a  >  0.1, 
which  is  of  no  concern  in  practical  cases,  but  points  to  some  hidden  approximation  in  substituting  a  S  function  for 
the  polarizability.  It  is  interesting  to  note  the  symmetry  between  forward  and  backward  directions  in  the  case  of  an 
absorbing  or  amplifying  layer.  For  any  value  of  a,  the  field  reflection  is  equal  to  the  field  gain  (attenuation)  in  the 
forward  direction.  This  result  is  consistent  with  a  simple  argument  that  the  emitting  layer  being  much  thinner  than 
the  wavelength  has  no  information  on  the  direction  of  the  field,  hence  has  equal  probability  of  emission  in  forward  and 
backward  directions.  Again  consistent  with  the  semiclassical  approximation,  in  a  gain  (absorbing)  layer  or  volume  of 
dimensions  much  smaller  than  the  wavelength,  the  local  dipole  induced  by  the  light  has  no  space  dependence,  hence 
cannot  have  any  “traveling  wave”  dependence.  In  the  case  of  a  macroscopic  volume  as  a  laser  gain  medium,  atoms 
at  different  positions  along  z  are  excited  in  phase,  hence  the  radiation  in  the  forward  direction  has  the  same  k  vector 
as  the  exciting  wave,  while  the  radiation  in  the  backward  direction  averages  out.  Note  that  it  is  the  field  gain,  rather 
than  the  gain  in  energy,  that  is  symmetric.  In  terms  of  the  reflected  and  transmitted  energy,  the  reflection  is  second 
order  with  respect  to  the  transmission  gain. 
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As  in  the  case  of  the  dielectric  layer,  it  is  the  basic  property  that  the  tangential  component  of  a  field  is  continuous 
across  an  interface  that  has  as  consequence  that  the  reflected  field  is  equal  to  the  gain  in  transmitted  field.  One  is 
used  to  associate  the  gain  (absorbing)  medium  with  a  discontinuity  in  field  (i.e.,  a  larger  field  at  the  output  of  the 
medium  than  at  the  input).  This  view  is  incorrect  for  layers  thinner  than  the  wavelength.  For  the  amplifying  thin 
layer,  the  dipolar  radiation  emits  on  both  sides  of  the  plane.  For  the  absorbing  layer,  the  induced  dipoles  make  a 
“sink”  for  the  field,  bringing  it  down  equally  on  both  sides  of  the  layer. 

It  is  a  peculiar  coincidence  that  the  quantum  field  description  of  stimulating  emission  of  a  single  atom,  being  emitted 
exclusively  in  the  direction  of  the  exciting  mode,  is  consistent  with  the  macroscopic  limit  of  the  semiclassical  theory, 
and  not  the  microscopic  one.  Quantum  electrodynamics  cannot  easily  be  extended  from  a  single  atom  consideration 
to  an  atomic  layer. 


The  non-reciprocal  element 

The  pairing  of  a  thin  gain  layer  with  a  dielectric  reflecting  layer  leads  to  a  structure  that  has  non  symmetric  reflection 
properties.  Let  us  consider,  from  left  to  right,  a  thin  gain  layer  (transmission  E%  =  (1  +  a)Ei]  reflection  Er  =  a/;) 
followed  at  a  distance  £  =  3 A/8  by  a  dielectric  layer  (reflection  Er  ~  irE{\  transmission  E%  =  y/\  —  r2J3j).  The  round 
trip  between  the  two  layers  corresponds  to  a  phase  factor  exp{— 2 ik£}  ~  i.  Let  us  neglect  for  simplicity  multiple 
reflections.  The  combined  reflection  from  both  interfaces,  for  a  beam  incident  from  the  left,  is:  Er  =  (a  —  r)Ei ,  which 
is  zero  if  a  =  r.  For  a  beam  incident  from  the  right:  Er  =  i(a  +  r)U;,  which  is  finite  if  a  =  r. 

A  “robust”  Layer  structure 

The  properties  of  non- reciprocity  in  reflection  can  be  generalized  to  more  complex  multiple  quantum  well  structures. 
Standard  matrix  calculation  is  used  to  calculate  the  transmissivity  and  reflectivity  of  a  multiple  quantum  well  structure. 
Since  the  GaAs  substrate  is  opaque  to  the  design  wavelength  of  this  laser  (860  nm),  it  will  have  to  be  etched  away. 
From  both  sides,  the  “nonreciprocal  gain  structure”  is  bounded  by  an  antireflection  stack.  It  is  preferable  to  have 
the  AR  coating  included  in  the  multilayer  structure  grown  by  MBE  or  MOCVD,  rather  than  to  have  an  additional 
manufacturing  step. 

The  layer  structure  is  designed  for  maximum  reflectivity  from  one  side,  zero  reflectivity  from  the  other  side.  There  is 
however  an  additional  condition  of  the  phase  shift  upon  reflection,  for  the  reflecting  side.  This  condition  is  dictated  by 
the  potential  gyro  application.  The  performances  of  a  laser  gyro  at  low  speed  are  affected  by  backscattering.  At  each 
round  trip,  some  small  fraction  of  the  radiation  of  —  for  instance  —  the  clockwise  propagating  beam  e£ 1  exp {i[ut  —  kz]} 
is  reflected  into  the  cavity  of  the  counterclockwise  propagating  beam.  If  the  phase  of  the  backscattering  coefficient 
e  is  such  that  the  reflection  has  a  component  in  phase  with  the  counterclockwise  field  £2  exp{i[u>t  +  ^he  latter 
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will  be  “injection  locked”  by  the  backscattering.  There  is  no  injection  locking  if  the  backscattering  is  90°  out  of  phase 
with  the  field  £ 2  • 

For  use  in  a  ring  laser,  it  is  essential  that  the  phase  of  the  reflected  wave  be  at  90°  from  that  of  the  transmitted 
wave.  This  phase  relationship  will  prevent  locking  of  the  laser  gyro. 

Let  us  consider  a  structure  such  as  detailed  in  table  1  inserted  in  a  ring  cavity,  with  the  clockwise  beam  incident 
first  upon  the  face  of  zero  reflection.  Let  be  the  phase  shift  upon  transmission,  and  ipr  the  phase  shift  upon  reflection 
of  the  second  surface.  The  resonance  condition  for  the  ring  cavity  is  that  the  phase  shift  for  the  light  path  outside  the 
MQW  structure  be  - varphit .  The  condition  that  counterclockwise  wave  reflected  at  the  right  interface  be  90°  out  of 
phase  with  the  transmitted  phase  is: 

(~Vt)  +  Vr  =  +  ~ 

_  7T 

<pr-2<pt  =  -. 

(10) 

The  last  condition  is  that  of  “robustness”.  Given  an  error  on  any  of  the  growth  parameter  (error  bar  on  index  of 
refraction,  layer  thickness  and  composition),  the  change  in  structure  properties  should  be  negligible.  This  “robustness” 
of  the  structure  was  found  to  vary  over  wide  ranges  depending  on  the  particular  choice  of  parameters. 

There  is,  at  this  point,  no  systematic  method  to  search  simultaneously  for  zero  reflection  from  one  direction, 
maximum  reflection  from  the  other,  an  optimum  phase  shift  upon  reflection  and  transmission,  a  maximum  gain,  and  a 
robust  structure.  The  gain  in  the  quantum  wells  is  proportional  to  the  imaginary  index.  First,  the  resonant  wavelength 
had  to  be  determined  for  a  GaAs  quantum  well  layer.  Knowing  the  thickness  and  the  A1  content  in  the  barrier  region, 
a  g°°d  estimate  of  the  resonant  wavelength  can  be  determined  by  calculating  the  energy  levels  of  the  quantum  well. 
In  the  actual  gain  region,  the  quantum  wells  are  separated  by  half  a  wavelength.  This  has  been  shown  to  increase  the 
overlap  of  the  anti-nodes  of  the  internal  local  electric  fields  with  the  individual  quantum  wells  [1],  The  anti-reflection 
layers  consist  of  a  half  wavelength  layer  followed  by  alternating  quarter  wavelength  layers  of  AlAs  and  AlGaAs.  The 
asymmetry  in  the  structure  is  introduced  with  the  “dielectric  layers”,  which  are  the  four  (on  one  side)  and  six  (on  the 
other  side)  separating  the  gain  structure  from  the  antireflection  structure. 

It  should  be  noted  that  this  is  a  unique  type  of  vertical  cavity  laser  because  the  absence  of  heat  dissipating  substrate. 
Becausee  the  GaAs  substrate  has  to  be  etched  off,  all  the  heat  dissipated  in  the  gain  layer  has  to  be  conducted  away 
transversally  through  the  very  thin  structure.  To  maximize  heat  dissipation  and  provide  mechanical  support,  the 
thickness  of  the  layers  hording  the  gain  region  was  chosen  to  be  in  excess  of  one  micron. 

The  values  for  the  refractive  index  are  taken  from  the  data  in  [2]. 

There  is  an  optimum  gain  (imaginary  index)  to  attain  this  zero  reflectivity  in  the  CW  direction.  For  values  other 
than  the  gain  used  in  these  calculations,  the  reflectivity  increases  on  either  side  of  this  optimum  value.  A  graph  of  the 
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material 

thickness  (nm) 

index  (r) 

index  (i) 

comment 

air 

1. 

0.000 

boundary 

AZo.25Gao.75  As 

126.30 

3.3967 

0.000 

stop  etch  layer 

AlAs 

71.90 

2.9835 

0.000 

Alo.2bGao.75  As 

63.15 

3.3967 

0.000 

AlAs 

71.90 

2.9835 

0.000 

AR 

Alo.25Gao.75  As 

63.15 

3.3967 

0.000 

AlAs 

71.90 

2.9835 

0.000 

AR 

sAlo.25Gao.75As 

63.15 

3.3967 

0.000 

AlAs 

71.90 

2.9835 

0.000 

AR 

Alo.2bGao.75  As 

41.00 

3.3967 

0.000 

reflector 

AlAs 

85.00 

2.9835 

0.000 

reflector 

Alo.2bGao.75  As 

59.00 

3.3967 

0.000 

spacer  r —  a 

AlAs 

76.62 

2.9835 

0.000 

(2  layers) 

Alo.2bGao.75  As 

59.00 

3.3967 

0.000 

spacer  r  —  a 

AlAs 

76.62 

2.9835 

0.000 

(2  layers) 

Alo.2bGao.75  As 

1108.78 

3.3967 

0.000 

spacer  r  —  a 

GaAs 

Quantum  well 

Gao. so  A/0,20  As 

; 

pump  layer 

GaAs 

if!  il 

3.6470 

-0.014 

Quantum  well 

Gao.soAlo.2oAs 

115.00 

3.4130 

0.000 

pump  layer 

. . . 

19  pairs  total  between - 

GaAs 

10.00 

3.6470 

-0.014 

Quantum  well 

Gao.8oAlo.2oAs 

115.00 

3.4130 

0.000 

pump  layer 

GaAs 

10.00 

3.6470 

-0.014 

Quantum  well 

Gao. go  A/0.20  As 

115.00 

3.4130 

0.000 

pump  layer 

GaAs 

10.00 

3.6470 

last  Quantum  well 

Alo.2bGao.7bAs 

1108.78 

3.3967 

0.000 

spacer 

AlAs 

76.62 

2.9835 

0.000 

(1  layer) 

Alo.2bGao.7b  As 

59.00 

3.3967 

0.000 

spacer 

AlAs 

76.62 

2.9835 

0.000 

(1  layer) 

Al  o.2b  Gao. 75  As 

35.50 

3.3967 

0.000 

reflector 

AlAs 

71.90 

2.9835 

reflector 

Al 0.25  Gao. 7b  As 

63.15 

3.3967 

AR 

AlAs 

71.90 

2.9835 

0.000 

AR 

Alo.2bGao.7bAs 

63.15 

3.3967 

0.000 

AR 

AlAs 

71.90 

2.9835 

0.000 

AR 

Alo.25Gao.75As 

63.15 

3.3967 

0.000 

AR 

AlAs 

71.90 

2.9835 

0.000 

AR 

Alo.25Gao.75As 

126.30 

3.3967 

0.000 

etch  layer 

AlAs 

10.00 

2.9835 

0.000 

1st  etch 

GaAs 

3.6470 

0.000 

substrate 

s 

Table  1:  Detailed  constitution  of  the  AlGaAs  MQW  structure  optimized  for  zero  reflection  at  858  nm,  and  maximum  robustness. 
Note  the  two  layers  of  1108  nm  for  heat  sinking  and  mechanical  rigidity. 
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relfectivity  as  a  funstion  of  gain  is  shown  in  Fig.  3 


Wavelength  (nm) 

Figure  1:  reflection  spectrum  for  light  incident  from  the  right  (dotted  line)  or  the  left  (solid  line)  of  the  structure  of  table  L 


Figure  2:  Transmission  spectrum  of  the  non-reciprocal  structure. 


The  sample 

One  sample  was  grown  at  the  CHTM  by  MOCVD.  The  properties  of  that  sample  did  not  bare  any  ressemblance  with 
the  prediction.  In  fact,  the  sample  had  luminescence  at  960  nm  instead  of  860,  which  cannot  even  be  explained  by  the 
sloppiness  of  the  CHTM  crystal  grower. 

Another  sample  was  grown  by  MBE  at  Sandia  Laboratories  Etching  of  the  substrate  was  made  up  to  the  last  AlAs 
“stop-etch”  layer.  However,  a  last  slow  etch  was  made  to  eliminate  the  AlAs  layer  up  to  the  Alo.25Gao.75As  layer, 
which  is  not  so  easily  oxydized  as  AlAs.  The  diameter  of  the  etched  zone  was  kept  as  small  as  possible,  compatible 
with  the  size  of  the  pump  and  laser  beams.  For  better  heat  removal,  the  etched  sample  was  mounted  on  a  sapphire 
window  that  has  a  hole  drilled  in  it  to  allow  passage  of  the  beam. 
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Figure  3:  Reflectivity  in  the  CW  at  858  nm,  as  a  function  of  the  imaginary  index  (gain) 


The  unidirectional  MQW  laser 

Laser  cavity 


Figure  4:  MQW  laser  cavity. 

Unidirectional  operation  is  expected  for  a  ring  laser  in  which  the  non-reciprocal  MQW  structure  is  used  as  a  gain 
medium.  The  MQW  structure  is  pumped  with  radiation  from  a  dye  laser  (LDS  698  dye)  which  had  a  maximum  gain 
in  the  700  -730  nm  region.  This  range  corresponds  to  the  optimum  pumping  range  for  the  GaAs/AlGaAs  quantum 
wells.  The  maximum  pump  power  was  300  mW. 

The  ring  laser  cavity  is  sketched  in  Fig.  4)).  Basically  there  are  three  mirrors  that  are  configured  to  make  a 
triangular  ring  cavity  and  two  micro-lenses  to  focus  the  light  into  both  sides  of  the  MQW  sample.  The  sample  is  being 
pumped  collinearly  through  a  dichroic  mirror.  Since  heat  dissipation  is  transverse,  the  transverse  dimension  of  the 
gain  medium  is  also  kept  to  a  minimum  (estimated  spot  size  of  the  pump  beam  on  the  sample:  «  20/xm).  The  lenses 
are  the  most  critical  elements  of  the  cavity.  Because  of  the  tight  focusing  (/  =  4  mm),  their  curvature  is  strong,  and 
it  becomes  difficult  to  minimize  the  reflections  losses  at  the  four  surfaces  that  are  traversed  at  each  round-trip.  The 
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lenses  used  in  the  experiment  were  GelTech  (originally  Corning)  aspheric  micro  lenses  with  a  focal  length  of  about  4 
mm  and  a  working  distance  of  about  2  mm,  with  “V  coat”  antireflection  coating  (guaranteed  specification  <  0.25% 
per  surface,  target  specification  as  low  as  0.10%  per  surface). 

The  mirrors  were  selected  from  VLOC,  with  high  reflection  coefficient  (>  99.9%)  centered  around  860  nm  and 
partially  transmitting  (80%)  around  720  nm.  Since  the  pump  beam  was  fairly  strong  (a  few  hundred  milliwatts),  a 
20%  loss  in  pump  transmission  is  acceptable.  The  mirrors  were  flat  and  designed  for  30  degrees  incidence. 


Figure  5:  Unidirectional  lasing  in  the  CCW  direction  just  above  threshold 


cw 

Pump  Power  =  90  mW 


Wavelength  (nm) 


Figure  6:  Lasing  suppressed  in  the  CW  direction. 


Sample  mounting 

In  order  to  increase  the  heat  dissipation,  the  sapphire  window  is  then  placed  into  a  copper  block  which  is  mounted 
onto  a  thermolelectric  cooler.  This  Peltier  cooler  is  hooked  up  to  a  Wavelength  Electronics  temperature  controller  and 
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ccw 

Pump  Power  =  96  mW 


845  050  085  080  005  870 

Wavelength  (nm) 


Figure  7:  Stronger  lasing  in  the  CCW  direction  at  a  slightly  higher  pump  power. 


cw 

Pump  Power*  95  mW 


s/V*Vl 


845  850  055  080  M5  070 

Wavelength  (nm) 


Figure  8:  The  onset  of  weak  lasing  in  the  CW  direction. 

driven  by  a  Lamba  power  supply  which  limits  the  current.  The  controller  can  stabilize  the  temperature  to  about  0.01 
C  and  can  cool  the  apparatus  down  to  the  dewpoint.  On  most  days  here  in  this  very  dry  state,  the  sample  can  be 
cooled  to  about  18  C  and  condensation  can  be  avoided.  Heating  of  the  MQW  sample  is  further  reduced  by  introducing 
a  duty  cycle  for  the  pump  beam,  with  a  chopper  wheel  with  a  small  ratio  of  opening/closure  time.  A  50%  duty  cycle 
was  sufficient  to  achieve  lasing  without  excessive  thermal  load  on  the  sample. 

Unidirectional  operation 

Lasing  was  determined  by  observing  a  large  spike  in  the  spectrum  around  857  nm.  The  design  gain  per  pass  is  about 
1.6%.  Since  the  total  losses  due  to  all  of  the  elements  in  the  cavity  per  round  trip  is  between  1.2%  and  1.4%,  the 
alignment  is  obviously  quite  critical. 

The  sample  is  placed  in  the  cavity  with  the  etched  side  towards  the  pump  beam.  According  to  the  design,  the 
“zero”  reflection  is  on  the  other  side  of  the  etched  hole.  A  beam  incident  from  that  side  has  a  smaller  reflection  than 
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Figure  9:  Ratio  of  the  peak  intensities  (CCW/CW). 

a  beam  incident  on  the  side  that  was  etched.  The  lasing  wavelength  is  between  856  nm  and  857  nm  which  happens  to 
be  very  close  to  the  design  wavelength  of  the  structure. 

Unidirectionality  is  observed  at  low  pump  powers  in  the  CCW  direction  with  lasing  suppressed  in  the  CW  direction 
(see  figures  (5)  and  (6)).  As  the  pump  power  is  turned  up,  the  MQW  starts  lasing  in  the  CW  direction,  but  the  intensity 
was  still  less  than  the  beam  going  CCW  (see  figures  (7)  and  (8)).  The  linewidth  of  the  laser  peaks  increases  with 
the  power.  Close  to  threshold  (~90  mW),  the  linewidth  at  full  width  half  maximum  (FWHM)  is  measured  to  be 
around  0.5  nm,  which  is  the  resolution  of  the  spectrometer.  The  degradation  of  unidirectionality  with  pump  power  is 
expected,  since  the  design  “zero  reflection  gain”  corresponds  to  threshold  —  or  even  below  (see  figure  (3)). 

The  ratio  of  the  peak  intensity  as  a  function  of  pump  power  for  the  CCW  beam  to  the  CW  beam  is  shown  in 
figure  (9).  Above  threshold,  the  ratio  gets  progressively  smaller;  which  is  consistent  with  the  theoretical  curve  where  the 
smaller  reflectivity  occurs  at  an  optimum  gain,  with  this  reflectivity  increasing  as  the  gain  is  increased.  By  decreasing 
the  differential  losses,  this  increase  in  the  reflectivity  lowers  the  effectiveness  of  the  device  as  a  non- reciprocal  structure. 


A  laser  cube  gyro 

The  unidirectionality  property  demonstrated  above  can  be  exploited  in  a  miniature  ‘cube”  laser  gyro,  consisting  of 
two  90°  turning  prisms  with  the  MQW  gain  structure  deposited  on  the  hypothenuse  face  of  one  prism.  The  two  prisms 
are  optically  contacted  to  form  a  cube  (Fig.  10).  Depending  of  the  location  of  the  pump  spot  (indicated  by  an  arrow), 
the  lasing  will  be  clock-wise  or  count erclock- wise  in  the  cross-section  of  the  cube.  With  two  pump  spots  at  different 
height  in  the  prisms,  lasing  can  occur  in  opposite  direction  in  adjacent  planes.  Since  there  is  no  coupling  between  the 
cavities,  the  beat  frequency  between  the  two  lasing  planes  should  exhibity  a  perfect  gyroscopic  response.  The  beams 
can  be  extracted  from  the  laser  by  frustrated  total  reflection  with  an  auxiliary  prism,  as  indicated  on  the  figure. 
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Figure  10:  The  cubic  laser  gyro.  Two  90°  turning  prisms  constitute  the  two  halves  of  the  cube.  The  nonreciprocal  layer 
structure  is  grown  on  the  hypothenuse  plane  of  one  of  the  cubes.  The  two  prisms  are  optically  contacted.  The  laser  is  pumped 
at  two  different  heights,  resulting  in  unidirectional  operation  in  opposite  direction  in  two  parallel  planes. 

References 

[1]  M.  Y.  Raja,  S.  R.  Brueck,  M.  Osinski,  C.  F.  Shaus,  J.  G.  Mclnerney,  T.  M.  Brennan,  and  B.  E.  Hammons. 
Resonant  periodic  gain  surface  emitting  semiconductor  lasers.  IEEE  J.  Quantum  Electron .,  25:1500-1511,  1989. 

[2]  A.  Aspnes,  S.  M.  Kelso,  R.  A.  Logan,  and  R.  Bhat.  Optical  properties  of  AlGaAs.  J.  Appl.  Phys ,  60:754-766, 
1986. 


4-13 


TWO-DIMENSIONAL  SIMULATIONS  OF  ULTRAFAST  SILICON  AVALANCHE 
SHAPER  (SAS)  DEVICES  FOR  HIGH  POWER  SWITCHING 


R.  P.  Joshi 
Associate  Professor 

Department  of  Electrical  and  Computer  Engineering 


Old  Dominion  University 
Norfolk,  VA  23529-0246 


Final  Report  for: 

Summer  Faculty  Research  Program 
Phillips  Laboratory,  Kirtland  Air  Force  Base 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  Air  Force  Base,  DC 

and 

Phillips  Laboratory 


December  1996 


5-1 


TWO-DIMENSIONAL  SIMULATIONS  OF  ULTRAFAST  SILICON 
AVALANCHE  SHAPER  (SAS)  DEVICES  FOR  HIGH  POWER  APPLICATIONS 


R.  P.  Joshi 
Associate  Professor 

Department  of  Electrical  and  Computer  Engineering 
Old  Dominion  University 
Norfolk,  VA  23529-0246 


Abstract 

Silicon  Avalanche  Shaper  devices  have  been  projected  as  being  important  components  of  an  inexpensive, 
semiconductor-basedall-electricaltechnology  for  high  power  switching  and  pulse  shaping  applications.  The  primary 
advantage  of  this  technology  is  that  it  is  based  on  Silicon  material  which  is  easy  to  fabricate  and  has  a  well 
established  processing  technology.  Unlike  other  high  power  technologies,  the  SAS  devices  do  not  rely  on  external 
optical  triggering  which  eliminates  the  need  for  lasers  and  related  optical  circuitry. 

The  SAS  based  high  power  switching  technology  has  been  pioneered  and  tested  by  a  Russian  group. 
Though  preliminary  results  have  been  very  encouraging,  the  device  reliability  and  its  operating  capability  at  high 
voltages  have  not  been  studied  in  detail.  Furthermore,  the  rather  large  device  parameter  space  has  not  been  fully 
explored  to  ascertain  the  optimal  operating  point.  As  a  result,  the  set  of  device  parameters  necessary  to  yield  the 
desired  electrical  response  for  a  given  application,  remains  unclear.  Besides,  the  potential  for  internal  current 
filamentation  and  device  breakdown  in  the  Silicon  Avalanche  Shaper  has  not  been  analyzed.  All  of  the  these  aspects 
can  best  be  studied  and  understood  in  a  cost-effective  manner  through  numerical  simulations. 

In  this  research,  a  two-dimensional  simulator  for  the  SAS  device  has  been  developed  based  on  the  drift- 
diffusion  model.  Predictions  of  the  device  electrical  response  have  been  obtained.  The  role  of  transverse  variations 
in  the  doping  profile  have  been  analyzed  by  obtaining  the  transient  current  characteristics.  The  results  reveal  the 
development  and  propagation  of  internal  electric  field  waves.  The  wavefront  of  such  propagating  electric  waves  has 
been  shown  to  be  non-uniform,  and  to  depend  on  the  details  of  the  transverse  doping.  The  transverse  variations 
could  either  be  accidental  or  the  result  of  intentional  inhomogeneities  occurring  during  device  fabrication.  Simulation 
data  for  the  conduction  current  distribution  within  the  device  at  various  time  instants,  clearly  show  evidence  for  the 
growth  of  filamentary  modes.  The  filamentation,  however,  is  strongly  dependent  on  the  transverse  doping 
characteristics,  and  hence  could  in  principle  be  tailored. 
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TWO-DIMENSIONAL  SIMULATIONS  OF  ULTRAFAST  SILICON 


AVALANCHE  SHAPER  (SAS)  DEVICES  FOR  HIGH  POWER  APPLICATIONS 


R.  P.  Joshi 


HIGH  POWER  SWITCHING 

High  power  switches  are  the  crucial  components  in  a  variety  of  engineering  circuits.  The  switches  are 
needed  for  a  range  of  applications.  These  include:  (i)  high  voltage  pulse  generators,  (ii)  dc  to  RF  conversion 
circuits,  (iii)  high  energy  pulsed  lasers,  (iv)  for  the  generation  of  ultrawideband  microwaves,  (v)  high  frequency 
plasmatrons,  (vi)  impulse  radar  systems,  and  (vii)  frozen  waveform  generators  [1]. 

The  major  factors  which  need  to  be  considered  in  the  development  of  high  power  switches  include:  (a)  the 
efficiency  of  energy  delivery  from  the  pulser  to  the  load,  (b)  shaping  of  the  output  pulses,  and  (c)  the  control  of  the 
timing  and  its  relative  sequence.  Aspects  related  to  the  efficiency  arise  from  losses  in  the  switch  itself.  The  energy 
transferred  to  the  load  usually  must  be  delivered  as  a  pulse  of  given  rise  time  and  duration.  In  addition,  the  time 
of  delivery  of  the  pulse  frequently  must  be  controlled  accurately.  Also,  since  several  switches  may  be  used  in  one 
network  to  reduce  the  power  dissipated  per  switch  to  a  manageable  level,  close  synchronization  is  often  required  [2]. 

Traditionally,  fast  high-power  switching  was  realized  with  gas  discharge  devices.  These  devices,  however, 
have  a  number  of  inherent  shortcomings,  and  as  a  result,  have  largely  been  replaced  in  the  last  15-20  years  by  high 
power  semiconductors  [1].  The  shortcomings  associated  with  gas  discharge-based  or  mechanical  switching  include: 
(i)  an  inherently  slower  response,  (ii)  large  bulk  and  size,  and  (iii)  lower  thermal  capacity.  The  development  of 
materials  with  non-linear  resistivity  in  commercial  quantities  and  a  greater  range  of  electrical  characteristics,  in 
particular  those  whose  resistance  depends  upon  current  or  temperature,  gave  rise  to  the  utilization  of  solid  state 
devices  as  closing  and/or  opening  switches  in  pulse  power  applications  [3,4].  Such  semiconductor  based  switches 
have  a  number  of  potential  advantages.  These  advantages  include:  (i)  Compactness  and  small  size  leading  to  weight 
reductions,  (ii)  low  jitter  associated  with  optically  triggered  semiconductor  switches,  (iii)  larger  thermal  capacity, 
(iv)  ultrafast  switching  times  and  rapid  repetition  rates,  and  (v)  the  potential  for  a  higher  density  plasma  which  can 
initiate  a  rapid  transition  into  the  high  conducting  state. 

Power  switching  by  semiconductor  devices  of  any  type  occurs  through  a  strong  modulated  increase  in  the 
conductivity.  In  the  "OFF"  state,  the  device  is  in  its  high  resistive  state  and  so  effectively  blocks  any  external 
potential  applied  to  the  device.  This  region  can  either  be  the  space  charge  region  (SCR)  of  a  reverse-biased  junction 
which  is  completely  depleted  of  free  carriers  by  the  strong  external  field,  or  a  highly  resistive  low-doped 
semiconductor  region.  For  schemes  based  on  highly  resistive  low-doped  semiconductors,  the  "OFF"  state  is 
maintained  through  Schottky  contacts  which  provide  an  effective  barrier  to  carrier  injection.  In  the  "ON"  state,  the 
conductivity  of  this  region  increases  dramatically  as  it  becomes  filled  with  an  electron-hole  plasma.  The  electron- 
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hole  plasma  can  be  created  by  any  one  of  the  following  mechanisms: 

(a)  Photo-excitation  by  an  external  optical  source  such  as  a  laser  or  a  light  emitting  diode, 

(b)  Electron-beam  bombardment  to  create  a  plasma, 

(c)  Charge  injection  through  device  contacts  by  applying  a  circuit  voltage,  and 

(d)  By  initiating  an  internal  avalanche  breakdown  process  through  either  band-to-band  or  band-to-trap  impact 
ionization. 

The  use  of  a  solid  state  material  with  the  resistivity  changing  reversibly  by  several  orders  of  magnitude, 
provides  a  better  alternative  to  high  power  switches.  The  most  promising  semiconductor  switches  appear  to  be  those 
based  on  bulk  material  with  carrier  mobility  changing  throughout  the  entire  device  volume.  Such  materials,  in 
principle,  provide  the  highest  volumetric  efficiency  since  they  would  utilize  the  entire  material  volume  uniformly 
on  both  microscopic  and  macroscopic  scales  [2],  The  uniformity  also  works  to  ensure  a  homogeneous  current  density 
and  should  therefore  reduce  the  potential  for  localized  filamentation. 

CHARACTERISTICS  OF  HIGH  POWER  SWITCHES 

In  general,  solid  state  switches  for  high  power  applications  should  exhibit  the  following  desirable 
characteristics: 

•  The  ability  to  withstand  high  electric  fields  and  to  have  a  large  breakdown/blocking  voltage. 

•  Low  leakage  currents  to  preserve  a  high-impedance  "OFF"  state. 

•  Short  turn-on  times  attainable  through  high  mobility  charge  carriers. 

•  A  very  low  resistance  during  the  "ON"  state  and  the  ability  to  sustain  high  current  densities. 

•  Device  reliability  and  the  absence  of  internal  instabilities  such  as  current  filamentation. 

•  Good  thermal  conductivity  for  heat  dissipation. 

•  A  negative  temperature  coefficient  for  suppression  of  thermal  instabilities. 

•  Simple  structure  and  ease  of  fabrication  for  cost  minimization. 

SEMICONDUCTOR  SWITCH  MATERIALS 

The  choice  of  the  semiconducting  material  to  be  used  for  a  high  power  switch  is  very  important  for 
optimizing  the  electrical  characteristics,  its  efficiency  and  overall  utility.  As  already  mentioned,  the  switching  process 
has  to  be  initiated  throughout  the  bulk  of  the  device.  Bulk  generation  has  the  following  advantages:  (i)  Greater 
current  carrying  capability  due  to  the  larger  available  area,  (ii)  Better  uniformity  in  the  current  density  distribution, 
(iii)  easier  and  faster  heat  dissipation  for  improved  reliability  and  controlled  device  parameters,  (iv)  higher  energy 
handling  capacity,  (v)  lower  internal  resistance  leading  to  reduced  internal  voltage  drops  and  lower  "RC"  time 
constants. 

Given  the  importance  of  bulk  switching  and  plasma  propagation  throughout  the  device,  the  choice  of  a 
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material  for  high  power  switching  devices  is  guided  by  the  bulk  semiconductor  properties.  The  important  parameters 
are:  (i)  The  material  band  gap  which  controls  leakage  currents,  places  an  upper  limit  on  the  operating  temperature, 
the  intrinsic  carrier  concentration  and  the  impedance  of  the  "OFF"  state,  (ii)  The  breakdown  field  and  dielectric 
strength  which  determine  the  maximum  hold-off  voltage.  These  parameters  also  provide  a  measure  of  the  device 
length  necessary  to  ensure  the  blockage  of  high  voltages,  (iii)  The  saturated  drift  velocity  which  controls  the 
response  speed  and  the  current  density  levels,  (iv)  The  ionization  coefficient  which  determines  the  threshold  for 
impact  ionization  and  the  possibility  for  internal  carrier  generation  through  avalanching,  (v)  The  nature  of  the  band- 
gap  since,  a  direct  bandgap  semiconductor  provides  the  possibility  of  optical  excitation  while  an  indirect 
semiconductor  does  not.  (vi)  The  thermal  conductivity  which  may  become  a  relevant  consideration  for  energy 
dissipation  and  in  enhancing  the  power  handling  capacity,  (vii)  Ease  of  fabrication,  commercial  development  of 
the  semiconductor  and  relative  costs  of  device  manufacture. 

Based  on  the  above  considerations,  silicon  and  GaAs  emerge  as  the  materials  of  choice  for  high  power 
switches.  Of  these  GaAs  is  a  direct  bandgap  material  and  is  exclusive  to  optically  based  switches.  Silicon,  though 
unsuited  for  photoswitches,  has  a  number  of  advantages  over  GaAs.  For  example,  the  related  processing  technology 
is  very  well  developed  and  the  costs  involved  in  device  manufacture  are  the  least.  It  also  has  a  much  higher  carrier 
drift  velocity  which  ensures  fast  response  times  at  high  fields.  The  thermal  conductivity  is  also  larger  than  GaAs 
which  is  beneficial  for  high  power  switching.  Finally,  both  Ohmic  and  Schottky  contacts  can  be  fabricated  which 
are  robust  and  increase  the  device  reliability.  Furthermore,  Si  unlike  GaAs,  does  not  exhibit  the  intervalley  electron 
transfer  "Gunn  effect"  [5]  which  leads  to  negative  differential  conductivity  (NDC).  NDC,  as  is  well  known,  is 
detrimental  to  reliable  device  and  can  give  rise  to  internal  instabilities.  As  a  result,  this  research  focusses  on  Silicon 
based  high  power  switches. 

The  high  power  switch  studied  here  relies  on  the  creation  of  an  internal  travelling  electric  field  and  the 
impact  ionization  of  electron-hole  pairs  to  trigger  the  device  into  the  conducting  state.  For  an  efficient  transition 
into  the  "ON"  state,  the  associated  ionization  coefficient  "a"  for  the  material  has  to  be  large.  The  "asi"  value 
happens  to  be  larger  than  "a GaAs".  This  occurs  because  the  impact  ionization  threshold  energy  which  depends 
on  the  bandgap  is  lower  for  silicon.  Secondly,  the  disparity  between  electron  and  hole  effective  masses  is  lower  for 
silicon  as  compared  to  GaAs.  As  a  result,  both  electrons  and  holes  can  contribute  to  avalanche  multiplication  in  Si 
more  effectively  than  in  GaAs.  These  considerations  again  favor  silicon  as  the  material  of  choice  over  GaAs. 

RESEARCH  OBJECTIVES 

The  transient  behavior  of  silicon  avalanche  switches  which  are  used  for  high-power  waveform  shaping 
applications,  and  function  in  both  the  "closing"  and  "opening"  modes,  are  studied  here.  Numerical  simulations  have 
been  performed  to  predict  switching  currents  and  examine  the  repetitive  device  response.  The  main  research 
objective  is  to  simulate  two-dimensional  effects  and  thereby  probe  the  possibility  of  current  filamentation  and  the 
prospects  of  internal  non-uniform  current  density  channels.  Such  localized  current  channels  and  filamentation  is 
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known  to  be  destructive  and  needs  to  be  controlled  for  useful  device  performance.  However,  unless  the  two- 
dimensional  effects  are  first  understood  through  numerical  simulations  and  device  modeling,  one  cannot  hope  to 
control  these  effects.  The  simulations  presented  in  this  research  address  this  objective. 

Two-dimensional  effects  in  devices  can  arise  due  to  spatial  nonuniformities.  These  could  either  be  due  to 
non-uniformities  in  the  device  geometry  or  due  to  inhomogeneous  doping  distributions.  Spatially  non-uniform  traps 
and  defects  can  also  contribute  to  two-dimensional  effects.  In  this  research,  only  the  dopant  inhomogeneities  will 
be  addressed.  The  resulting  two-dimensional  current  distributions  which  can  lead  to  filamentation  and  localized 
heating,  will  be  evaluated  for  various  doping  levels  and  profiles. 

COMPARISON  BETWEEN  OPTICAL  AND  ELECTRICAL  TRIGGERING 

This  section  briefly  outlines  the  main  comparisons  between  the  optical  and  electrical  triggering  mechanisms 
for  semiconductor  switches.  The  comparison  provides  reasons  in  support  of  electrically  triggered  silicon  avalanche 
switches  (SAS)  and  hence,  provides  motivation  for  the  present  study. 

(i)  Complex  optical  sources  or  external  lasers  and  photogenerating  elements  are  not  required  for  electrical 
triggering.  This  reduces  the  size  of  the  switching  element  making  it  more  compact. 

(ii)  Electrically  switched  elements  can  easily  be  integrated  with  other  electronic  components  on  a  chip. 

(iii)  The  optical  switches  rely  on  direct  band  gap  semiconductors  such  as  GaAs,  InP  and  other  III-V  materials. 
All  of  these  are  more  expensive  to  fabricate  than  the  indirect  bandgap  silicon. 

(iv)  All  of  the  direct  bandgap  materials  used  for  optical  switches  have  satellite  valleys  which  leads  to  intervalley 
transfer.  This  phenomena  can  give  rise  to  internal  domain  formation  and  can  result  in  internal  instabilities. 

(v)  The  density  of  the  electron-hole  plasma  in  side  optically  excited  switches  usually  is  non-uniform.  This 
arises  from  the  Beer-Lampert  law  which  leads  to  an  exponential  distribution  of  electron-hole  pairs  internally. 
This  inherent  nonuniformity  can  lead  to  instabilities,  localized  current  channels  or  current  filamentation. 

THE  SAS  OPERATIONAL  PRINCIPLE 

The  silicon  avalanche  shaper  switch  (SAS)  was  first  proposed  and  successfully  demonstrated  by  a  Russian 
group  [6-8],  Simple  qualitative  discussions  of  the  SAS  operating  principle  have  been  given  in  several  papers  by 
Grekhov  et  al.  [6-9]  over  the  years.  The  concept  relies  on  the  generation  of  a  fast  propogating  electric  field  and  an 
accompanying  ionization  wave  within  the  device  through  the  application  of  a  rapid  reverse  bias  across  a  Silicon  p-n 
junction  device.  Basically  the  device  has  a  N+-N-P+  structure.  The  low-doped,  near  intrinsic  "N"  region  is  depleted 
of  carriers  during  the  holdoff  phase  and  supports  a  large  voltage  drop.  Initially  the  device  is  able  to  hold  off  an 
applied  voltage  and  a  negligible  current  flows  in  the  circuit.  This  corresponds  to  the  "OFF"  state.  This  applied  hold¬ 
off  bias  is  below  the  critical  breakdown  voltage.  As  soon  as  the  total  applied  peak  bias  exceeds  the  static  breakdown 
voltage,  the  switch  transitions  into  the  "ON"  state  through  the  rapid  avalanche  generation  of  electron-hole  pairs  within 
the  high-field  regions  of  the  device.  The  conductivity  is  therefore  enhanced  rather  quickly,  and  the  switch  triggers 
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into  the  conductive  "ON"  state.  The  switch  is  therefore  triggered  by  an  internal  avalanche  process  through  either 
band-to-band  or  band-to-trap  impact  ionization.  A  brief  summary  of  the  important  sequence  of  physical  events  is 
given  below. 

(i)  The  SAS  is  initially  in  the  non-conducting  "OFF”  state  with  a  moderate  reverse  bias  across  the  N+-N-P+ 
structure.  The  applied  voltage  is  held  to  a  value  below  the  characteristic  avalanche  voltage  of  the  Si  device. 
This  hold-off  voltage  can,  in  principle,  be  varied  by  changing  either  the  internal  doping  densities  or  the 
device  length.  Lowering  the  doping  for  example,  increases  the  internal  electric  field,  making  it  possible  for 
the  device  to  withstand  an  initiation  into  the  avalanche  mode.  Similarly,  increasing  the  device  length  also 
increases  the  holdoff  voltage  capability. 

(H)  For  turn-on,  a  sharply  reverse  voltage  pulse  is  applied.  As  a  result,  the  total  voltage  across  the  SAS  device 
then  exceeds  the  breakdown  limit.  At  the  higher  voltage,  a  high-  field  region  is  created  across  the  P+-N 
junction,  with  field  strengths  exceeding  the  critical  breakdown  value.  Impact  ionization  is  thus  initiated  in 
a  region  localized  near  the  P+-N  junction. 

(iii)  Rapid  impact  ionization  creates  a  high  conductivity  region  at  the  P+-N  junction.  The  electrons  and  holes 
thus  created  through  impact  ionization,  begin  to  move  in  opposite  directions.  The  motion  of  the  mobile 
charge  contributes  to  a  current  rise. 

(iv)  Next,  the  electric  field  in  the  highly  conduction  region  begin  to  collapse  as  the  separation  of  mobile  charge 
forms  an  internal  polarization  field.  The  electric  field  in  the  adjacent  non-conducting  N-region,  however, 
increases  to  sustain  the  overall  voltage  drop  across  the  device.  This  increase  in  the  electric  field  provides 
the  displacement  current  necessary  to  maintain  exact  continuity  of  the  total  current  across  the  entire  device. 

(v)  As  a  result  of  the  above  mechanism,  the  electric  field  grows  starting  from  the  P+-N  junction,  towards  the 
N+  anode  side  through  the  N-region  in  the  middle.  A  propagating  electric  field  wave  is  thus  set  up.  This 
propagating  field,  in  turns,  creates  a  travelling  impact  ionization  front,  and  is  responsible  for  the  uniform 
turn-on  of  the  entire  device. 

(vi)  The  current  in  the  SAS  device  can  be  turned  off  by  lowering  the  applied  bias  to  a  level  below  the  critical 
sustaining  breakdown  voltage.  Upon  lowering  the  external  voltage,  the  carriers  are  removed  through  a 
combination  of  drift  flush-out  and  recombination. 

A  schematic  for  the  turn-on  sequence  is  shown  in  Figure  1.  The  initial  field  in  the  N-P+  depletion  region 
is  below  the  critical  value  for  impact  ionization.  Hence,  the  carrier  concentration  and  the  conductivity  remain  at  a 
low  level.  On  the  application  of  a  large  reverse  bias,  the  field  increases  beyond  the  critical  value,  and  carrier 
generation  via  impact  ionization  is  initiated.  This  ionization  effectively  increases  the  local  conductivity,  and  forces 
the  growth  of  the  electric  field  in  an  adjoining  low  conductivity  region.  This  progressive  movement  of  the  electric 
field  away  from  the  junction  and  into  the  low  conductivity  N-region  continues,  as  carriers  continually  keep  getting 
generated  within  the  N-layer  starting  from  the  junction. 
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Fig.  1  Schematic  of  the  SAS  tum-on  sequence. 

ADVANTAGES  AND  BENEFITS  OF  THE  SAS  DEVICE 

Based  on  the  above  summary  of  the  SAS  operational  principle,  several  key  advantages  can  be  identified. 

These  are  based  on  the  inherent  operating  principle  and  the  device  physics.  The  following  is  a  list  of  important 

advantages  for  this  SAS-based  high  power  technology,  which  has  also  been  discussed  previously  [10]: 

(i)  The  elimination  of  external  lasers  for  triggering  and  turn-on.  This  simplifies  the  complexity  of  the  circuit. 

(ii)  A  natural  self-propagation  of  the  electric  fields  within  the  device.  This  facilitates  a  gradual  and  more 
homogeneous  tum-on.  The  potential  for  instabilities  associated  with  spatial  non-uniformities  as  with 
localized  charge  injection,  is  thus  greatly  reduced. 

(iii)  The  application  of  a  rapidly  increasing  voltage  pulse  dramatically  decreases  the  likelihood  of  current 
filamentation  and  "micro-channels".  As  is  well  known  [11],  such  filamentation  is  associated  with  the  rapid 
growth  of  the  electric  fields  within  localized  regions  of  the  device  due  to  density  fluctuations.  However, 
in  this  device,  the  application  of  a  fast  voltage  ramp  increases  the  electric  fields  everywhere  within  the 
device.  As  a  result,  currents  cannot  remain  localized  within  narrow  channels  since  the  rapidly  increasing 
field  creates  charge  in  the  adjacent  regions  through  impact  ionization.  The  carrier  density  is  therefore 
enhanced  over  a  broad  area,  thereby  swamping  the  initial  fluctuations  (if  any)  in  the  local  carrier  density. 

(iv)  Another  advantage  of  this  technology  is  its  reliance  on  Si  rather  than  GaAs.  Silicon  has  the  following 
inherent  advantages  over  GaAs,  a  material  which  is  often  used  in  photoconducting  switches.  The  advantages 
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include:  (a)  Ease  of  fabrication,  (b)  A  higher  thermal  conductivity  for  better  heat  dissipation  and  better 
protection  against  thermal  runaway,  (c)  A  higher  carrier  saturation  velocity  which  should  lead  to  higher 
currents  and  faster  turn-on.  (d)  A  material  system  inherently  free  of  Negative  Differential  Resistance  (NDR) 
effect.  As  is  well  known,  the  NDR  effect  in  GaAs  associated  with  inter-valley  transfer  gives  rise  to  a 
potential  instability  [12]. 

(v)  Furthermore,  the  S  AS  by  being  a  p-n  junction  device,  naturally  ensures  that  the  regions  of  high  electric  field 
occur  within  the  bulk  device  and  not  at  the  end  contacts.  As  a  result,  the  device  circumvents  many  of  the 
"contact  problems"  that  are  associated  with  other  devices.  These  problems  include  uncontrolled  thermionic 
emission  and  carrier  tunneling  due  to  high  fields  at  the  contact,  excessive  heating  and  contact  degradation. 

(vi)  Finally,  the  SAS  also  offers  the  possibility  of  scaling  the  hold-off  voltage.  This  could  be  done  by  stacking 
several  such  devices  in  series,  increasing  the  size  of  the  central  N-region,  and  by  decreasing  the  doping 
levels.  Therefore  there  do  not  appear  to  be  any  fundamental  limits  on  the  voltage  scaling. 

ASPECTS  OF  THE  TWO-DIMENSIONAL  RESEARCH  SIMULATIONS 

There  are  basically  two  important  aspects  that  are  addressed  in  this  research.  These  two  areas  are:  (1)  The 
issue  of  current  filamentation,  and  (2)  Determining  the  role  of  two-dimensional  variations  in  the  doping  density  on 
the  electrical  response  and  device  stability.  Both  of  these  aspects  are  important  in  determining  the  reliability  and 
usefulness  of  the  SAS  device.  For  instance,  if  the  doping  is  not  homogeneous  across  all  of  the  cross-sections  of  the 
device,  the  internal  conductivity  will  also  be  nonuniform.  The  nonuniform  conductivity  can  then  lead  to  the 
formation  of  localized  current  channels  with  the  possibility  of  premature  switch  turn  on. 

As  is  well  known,  current  filamentation  arises  from  spatial  non-uniformities  across  the  device  cross  section, 
and  is  basically  a  two-dimensional  effect.  Such  non-uniformities  can  arise  from  any  one  of  the  following  reasons: 
(a)  Doping  inhomogeneities  (either  intentional  or  accidental)  during  the  processing  and  fabrication  steps,  (b)  The 
formation  of  spatially  non-uniform  defects  within  the  semiconductor,  (c)  The  development  of  non-uniform  fields 
within  the  semiconductor  during  device  operation  which  can  lead  to  non-uniform  impact  ionization  with  variable 
charge  creation,  (d)  Thermal  effects  which  again  can  lead  to  non-uniform  carrier  generation  within  the  device. 

In  order  to  explore  these  issues,  a  two-dimensional  numerical  model  has  been  developed.  The  model  uses 
the  drift-diffusion  approach,  given  its  relative  simplicity  and  modest  computational  requirements.  Furthermore,  since 
the  dimensions  of  the  SAS  devices  used  in  this  study  are  well  above  the  micrometer  range  and  the  time  scale  of 
interest  are  relatively  large  (higher  than  10  ns),  the  more  complex  approaches  such  as  the  Monte  Carlo  method  will 
not  be  necessary.  The  results  obtained  from  the  drift-diffusion  scheme  are  therefore  expected  to  be  adequate. 

THE  DRIFT-DIFFUSION  TRANSPORT  APPROACH 

The  overall  current  transport  in  the  Drift-Diffusion  approach  is  described  by  the  following  equation: 
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(1) 


J=nqvn  +pgv  -qDA  +qD  A +e^ 

ax  y  ax  dt 

where  E  is  the  electric  field,  e  is  the  permitivity  of  material,  "n"  and  "p"  are  the  electron  and  hole  densities,  and  Dnp 

the  electron/hole  diffusion  coefficients.  Both  the  conduction  and  displacement  currents  are  included  in  this 

formulation.  This  equation  is  derived  from  the  Boltzmann  transport  equation  under  the  following  assumptions: 

a)  An  isotropic  and  parabolic  single  band. 

b)  Inherent  assumption  of  the  relaxation  time  approximation. 

c)  Absence  of  degeneracy  with  no  hot  carrier  effects. 

d)  A  homogeneous  collision  time.  The  validity  of  the  drift-diffusion  scheme  also  requires  that  the  scattering 

time  be  significantly  smaller  than  the  simulation  time  scale  of  interest. 

e)  No  strong  off-equilibrium  effects,  and 

f)  An  assumption  that  the  length  scales  for  spatial  variation  of  the  electric  field  and  impurity 

concentrations  are  much  longer  than  the  carrier  mean  free  path. 

The  model  developed  for  the  study  of  the  SAS  devices  had  the  following  salient  features: 

•  It  was  based  on  the  drift-diffusion  approach. 

•  It  explicitly  solved  for  the  two-dimensional  distributions  of  the  internal  electric  fields  and  carrier 
densities. 

•  It  yielded  the  self-consistent  transient  analysis,  as  well  as  the  final  steady-state  response. 

•  The  model  was  bipolar  in  nature,  and  could  treat  both  electron  and  hole  transport. 

•  An  important  component  of  the  model  consists  of  rate  equations  which  describe  the  transient 
dynamics  of  the  free-carrier  generation,  trapping  and  recombination. 

•  An  external  circuit  was  incorporated  in  the  model. 

•  The  model  took  account  of  the  internal  displacement  current. 

•  An  added  feature  of  the  model  was  the  inclusion  of  impact  ionization  at  high  electric  fields.  This 
mechanism  is  very  important  since  it  is  at  the  heart  of  the  switch  "tum-on"  process. 

•  The  numerical  implementation  included  heat  generation  within  the  device.  A  one-dimensional 
diffusive  heat  flow  model  is  included,  and  changes  in  the  transport  parameters  with  any  resulting 
temperature  rise  taken  into  account. 

•  Finally,  carrier  injection  at  the  two  contacts  was  implemented  based  on  the  thermionic  emission 
model  which  uses  the  Richardson  coefficient  A'.  A  value  of  80  A/cm2/K2  was  assumed  for  A*. 
The  barrier  heights  at  each  contact  were  assumed  fixed  at  a  value  of  0.5  eV  independent  of  the 
current  density. 
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SEMICONDUCTOR  MODEL  IMPLEMENTATION  AND  SIMULATION  SCHEME 

For  numerical  implementation,  the  SAS  device  was  spatially  divided  into  "N"  boxes  each  with  a  finite  spatial 
extent.  The  mesh  sizes  Ax  along  the  longitudinal  x-direction,  and  Ay  along  the  transverse  y-direction,  were  chosen 
to  be  equal.  The  density  of  free  carriers,  ionized/unionized  donors  and  acceptors,  the  average  internal  electric  field, 
and  device  temperature  were  all  represented  by  discrete  values  at  the  center  of  each  box.  Thus  the  above  parameters 
were  assumed  to  be  fixed  within  each  box,  but  to  vary  from  box  to  simulation  box  during  each  time  step  of  the 
simulation.  A  schematic  of  the  device  simulation  region  along  with  the  circuit  is  shown  in  Fig.  2. 
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Fig.  2  Schematic  of  the  circuit  and  the  device  simulation  region. 

The  temporal  scale  was  also  discretized  into  time  intervals  of  magnitude  At.  This  time  step  At  was  chosen 
to  satisfy  the  requirement  of  being  greater  than  both  the  relaxation  time  and  the  internal  collision  times.  In  order 
to  obtain  numerical  stability  of  the  space-time  discretization,  the  well  known  "Courant-Fredericks-Lewy"  inequality 
condition  [13]  was  used  to  limit  the  time  step  At.  Thus:  At  <  Ax/[2  vj.  The  specific  silicon  SAS  device  structure 
used  for  the  simulations  was  a  N+-N-P*  diode  with  dimensions  of  40pm,  250pm  and  50pm,  respectively.  A  total 
of  150  discretized  boxes  were  chosen  along  the  longitudinal  x-direction  for  the  device  simulations. 

RESULTS  AND  DISCUSSION 

This  section  provides  all  the  numerical  simulation  results  that  were  obtained  for  the  SAS  device.  The 
simulations  were  carried  out  to  predict  and  gauge  the  electrical  response  characteristics.  The  2D  implementation  of 
the  drift-diffusion  model  was  used  to  obtain  the  transient  response  for  four  different  doping  profiles  along  the 
transverse  direction.  These  four  doping  profiles  chosen  for  the  simulations  were  as:  (i)  A  sinusoidal  shaped  centro- 
symmetric  distribution  with  a  minimum  dopant  density  at  the  two  opposite  surfaces  and  a  maxima  over  the  axial 
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center,  (ii)  A  sinusoidal  shaped  centro-symmetric  distribution  with  maximum  dopant  density  at  the  two  opposite 
surfaces  and  a  minima  over  the  axial  center,  (iii)  A  two-layer  asymmetric,  off-center  composite  distribution,  and 
(iv)  Uniform  doping  profiles  along  the  transverse  direction.  Three-dimensional  plots  showing  details  of  the  dopant 
distribution  profiles  for  each  case  are  shown  next  along  with  a  discussion  of  the  results  obtained  for  each  profile. 

TWO-DIMENSIONAL  RESULTS  AND  ANALYSIS:  CASE  I 

The  SAS  device  was  taken  to  comprise  of  N+-N-P+  Si  material  with  nonuniform  doping  profiles  along  the 
transverse  direction  for  each  of  the  three  regions.  The  same  functional  form  was  used  for  the  doping  profiles  in  the 
three  constituent  regions.  The  peak  dopant  density  values  were  taken  to  be:  NDm“  =  5  x  1016  cm'3  in  the  N+  region, 
NDmax  =  1014  cm'3  for  the  lower  doped  N-layer,  and  NAm“  =  5  x  1016  cm'3  for  the  P+  region.  The  external  circuit 
was  as  shown  in  Figure  2.  The  device  area  was  2  x  10'2  cm2  and  the  operating  temperature  was  assumed  to  be  300 
K.  For  simplicity,  a  periodic  "saw-tooth"  type  voltage  waveform  as  shown  in  Fig.  3  was  assumed  for  the  voltage 
source.  Such  a  waveshape  is  similar  to  that  actually  used  in  preliminary  experimental  tests  so  far  [14]. 

Simulation  results  for  the  initial  unbiased  steady  state  conditions  within  the  SAS  device  for  the  first  of  the 
four  doping  profiles  are  shown  in  Figures  4-8.  The  dopant  profile  for  this  case  was  taken  to  be  a  sinusoidally 
symmetric  distribution  with  minimum  dopant  density  at  the  two  opposite  surfaces  and  a  maxima  on  the  central  axis. 
The  resulting  ionized  impurity  concentration  as  obtained  from  the  simulations  is  shown  in  Figure  4.  The  peak 
magnitude  of  the  ionized  impurity  density  nearly  equals  5  x  1016  cm'3  which  is  the  maximum  concentration  of  dopant 
atoms.  This  peak  lies  on  the  central  axis  of  the  device  and  the  profile  is  symmetric.  The  2D  electron  and  hole 
profiles  are  plotted  in  Figures  5  and  6,  respectively.  Their  transverse  shapes  follow  that  of  the  dopant  distribution. 
In  addition,  some  structure  along  the  longitudinal  direction  is  also  seen  at  the  junctions.  This  is  due  to  free  carrier 
diffusion  and  the  formation  of  built-in  electric  fields  and  depletion  regions  at  the  various  junctions  and  interfaces. 
The  longitudinal  electric  field  distribution  Ex  under  unbiased  steady  state  conditions  is  shown  in  Figure  7  and  the 
transverse  field  Ey  in  Figure  8.  A  number  of  features  associated  with  this  plot  are  immediately  evident.  First,  the 
magnitude  of  Ey  is  much  smaller  than  that  of  Ex.  This  is  to  be  expected  since  the  steady  state  electric  field  is  roughly 
proportional  to  the  negative  gradient  of  the  majority  carrier  density  "n(x,y)".  This  follows  from  the  requirement  of 
a  zero  net  conduction  current,  and  leads  to  Ey  being  proportional  to:  -[dn/dy].  Since  the  gradient  "dn/dy"  is 
antisymmetric  for  the  chosen  doping  profile,  the  electric  field  distribution  Ey  also  has  an  anti-symmetric  shape.  Next, 
since  the  gradient  of  the  majority  carrier  density  and  dopant  distribution  along  the  transverse  "y"  direction  is  not  as 
large  as  that  along  the  x-direction,  the  magnitude  of  Ey  is  smaller  than  Ex.  Finally,  since  electrons  and  holes  are 
oppositely  charged,  their  diffusion  along  the  transverse  direction  gives  rise  to  built-in  fields  Ey  which  are  mutually 
anti-parallel.  This  is  evident  from  Figure  8  as  the  Ey  direction  within  the  P+  region  on  the  left  is  opposite  that  of 
the  field  direction  in  the  N  and  N+  regions. 

The  transient  current  corresponding  to  this  voltage  waveform  of  Fig.  3  is  shown  in  Figure  9.  The  initial 
spike  represents  a  displacement  current  created  in  response  to  the  voltage  ramp  from  0  Volts  to  600  Volts.  Following 
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Fig.  3  Applied  voltage  waveform 


Steady  state  ionized  impurity  profile  for  unbiased  case  I  device 


Fig.  5  Steady  state  electron  distribution  for  unbiased  case  I  device 
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the  initial  build  up  of  the  internal  fields  and  carrier  densities  within  the  device,  a  steady  OFF  state  situation  is 
reached  and  the  current  drops  to  near-zero  levels.  The  current  remains  at  this  near-zero  level  until  the  application 
of  the  ionizing  voltage  ramp  at  15  ns.  The  current  then  increases  to  about  0.5  Ampere,  and  is  comprised  of  both 
the  displacement  and  conduction  components.  Of  these  two,  the  displacement  component  gradually  weakens  as  a 
steady  state  electric  field  distribution  begins  to  be  set  up  within  the  device.  This  gives  rise  to  a  slight  decrease  in 
the  current  from  the  0.75  Ampere  level  to  about  0.65  Ampere.  At  around  18  ns  in  time,  the  current  is  seen  to  begin 
increasing  dramatically  from  its  value  of  about  0.65  Ampere  to  a  maximum  of  about  2.2  Amperes.  This  maximum 
occurs  at  20  ns,  and  corresponds  to  the  time  at  which  the  applied  voltage  is  a  maximum  of  1600  Volts.  The 
increased  current  is  due  to  the  initiation  of  internal  impact  ionization  and  the  process  of  carrier  avalanche. 

Following  a  decay  in  the  applied  voltage,  the  current  begins  to  decrease  as  expected.  The  current  falls  below 
zero  at  about  21  ns,  and  assumes  a  slight  undershoot  before  settling  to  a  near-zero  level  at  around  32  ns.  The 
negative  current  signifies  that  the  SAS  device  is  functioning  as  an  active  element  and  supplying  power  to  both  the 
resistor  and  the  voltage  element.  Physically,  the  source  of  this  power  supply  comes  from  the  stored  electromagnetic 
energy  within  the  SAS  device  which  had  been  built  up  during  the  applied  voltage  ramp.  The  undershoot 
characteristic  and  its  duration  is  expected  to  depend  on  the  actual  circuit  conditions.  For  instance,  a  higher  value 
of  the  circuit  resistance  would  lead  to  a  quicker  dissipation  of  the  energy  stored  within  the  SAS  device,  and  hence 
a  less  significant  undershoot.  Similarly,  the  presence  of  circuit  inductances  would  also  weaken  the  undershoot 
behavior. 

The  second  current  pulse  appears  to  have  a  slightly  lower  peak  than  the  first.  This  marginal  variation 
between  the  first  and  second  current  pulses  seems  to  suggest  that  a  "short-term  memory  effect"  is  inherent  in  the  SAS 
device.  Such  a  memory  effect  is  probably  associated  with  the  transverse  propagation  of  both  the  electric  field  and 
the  free  carrier  densities  within  the  device  during  the  turn-on  cycle.  The  movement  of  carriers  along  the  transverse 
direction  distort  the  state  of  the  semiconductor  system.  During  the  "turn-off'  transient,  changes  in  the  electric  fields 
are  predominantly  along  the  longitudinal  direction  since  this  is  the  orientation  of  the  applied  voltage.  The  result  is 
a  relatively  fast  redistribution  parallel  to  the  longitudinal  axis  leading  to  near  complete  recovery  of  the  semiconductor 
system.  However,  the  recovery  and  charge  redistribution  in  the  transverse  direction  is  not  as  rapid.  This  occurs  for 
two  reasons.  First,  the  electric  field  is  changing  at  a  very  slow  rate  as  there  is  no  applied  voltage  in  this  direction. 
Second,  since  the  values  of  the  transverse  field  Ey  are  much  smaller  than  those  of  Ex,  the  driving  force  for  carrier 
movement  and  relocation  is  weak.  Consequently,  the  inherent  "relaxation  time"  or  the  duration  necessary  for 
bringing  the  semiconductor  system  back  into  the  initial  "OFF  state"  is  quite  large.  From  the  simulation  results,  it 
appears  that  this  time  is  more  than  25  ns,  since  the  SAS  does  not  appear  to  have  fully  recovered  at  the  start  of  the 
second  pulse. 

Results  pertaining  to  the  internal  electric  field  are  presented  next.  The  longitudinal  components  of  the 
electric  field  at  10  ns,  18  ns  and  20  ns  are  presented  in  Figures  10-12.  From  these  plots  it  is  obvious  that  the  internal 
field  grows  and  propagates  from  the  P+-N  junction  on  applying  the  biasing  pulse.  A  travelling  electromagnetic  wave 
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is  thus  evident.  From  Figure  12,  this  Ex  wave  is  seen  to  develop  a  distinct  hump  near  the  center  of  the  device  at 
20  ns.  As  a  result  of  this  hump,  the  field  magnitude  over  the  central  portion  of  the  P+-N  junction  boundary  in  Figure 
12  is  not  as  high  as  at  the  two  ends.  This  is  in  keeping  with  the  requirement  that  the  line  integral  of  the  longitudinal 
field  Ex  have  a  constant  value  across  all  paths  parallel  to  the  longitudinal  axis.  Since  the  electric  field  increases  at 
the  central  portion  of  the  device  over  the  central  axis,  its  value  on  the  central  axis  at  the  P+-N  junction  is  relatively 
lowered  to  maintain  the  path  independent  requirement. 

The  snapshot  of  the  transverse  field  component  at  20  ns  has  been  shown  in  Figures  13.  The  values  are  much 
smaller  than  those  of  Ex  at  the  corresponding  times  as  may  be  expected.  The  temporal  development  of  Ey  reveals 
the  formation  and  movement  of  an  electron  wave,  and  can  be  understood  based  on  the  following  physical  sequence 
of  events.  Application  of  the  initial  voltage  ramp  from  0  Volts  to  600  Volts  widens  the  depletion  region  as  compared 
to  the  unbiased  steady  state.  Electrons  present  on  the  N-side  of  the  P+-N  junction  move  away  from  the  junction  and 
into  the  N  region.  These  carriers  not  only  have  a  longitudinal  motion  into  the  N-region,  but  are  also  diffuse  laterally 
away  from  the  center.  However,  the  net  transverse  current  has  to  equal  zero  at  all  times  since  there  is  no  conductive 
path  in  this  direction.  This  forces  a  transverse  electric  field  Ey  to  be  built  up  for  establishing  a  dynamic  equilibrium 
with  zero  net  current.  The  result  is  the  formation  of  anti-symmetric  peaks  in  Fig.  13  within  the  N-region,  at 
locations  away  from  the  junction  boundary.  Since  Ey  is  proportional  to  the  negative  gradient  of  the  free  carrier 
density,  the  profile  is  anti-symmetric  relative  to  the  central  axis. 

Figure  1 3  also  reveals  secondary  bumps  behind  the  anti-symmetric  peaks.  The  secondary  bumps  are  closer 
to  the  junction  boundary  and  are  caused  by  the  injection  of  electrons  into  the  N-region  from  the  P+-N  boundary. 
The  bumps  indicate  that  the  transverse  distribution  of  this  wave  of  injected  electrons  is  once  again  non-uniform 
leading  to  transverse  diffusion.  The  bumps  show  that  the  electron  injection  responsible  for  creating  the  secondary 
Ey  profile  arises  from  impact  ionization.  This  is  based  on  the  following  argument.  The  P+-N  junction  prior  to  the 
start  of  the  1600  Volt  voltage  ramp  is  already  in  a  state  of  depletion  as  a  result  of  the  600  Volt  reverse  biasing 
voltage.  It  is  thus  incapable  of  supplying  any  free  carriers.  Hence,  electronic  injection  can  only  arise  from 
avalanching  and  impact  ionization  across  the  junction  boundary. 

The  current  distributions  within  the  SAS  device  are  shown  in  Figures  14  and  15.  The  longitudinal 
conduction  current  at  18  ns  shown  in  Figure  14  is  relatively  large  throughout  the  N+  and  the  P+  regions.  This  is 
expected  since  these  regions  have  high  carrier  densities  leading  to  large  drift  current  contributions.  The  conduction 
current  within  the  middle  N-region,  however,  is  negligible  by  comparison.  Figure  15  at  20  ns  reveals  a  general 
increase  in  the  current.  Within  the  N  region,  the  conduction  current  increase  is  dramatic  over  the  central  axis.  This 
can  be  attributed  to  three  factors.  The  first  arises  from  the  movement  of  the  existing  mobile  electrons  within  the 
N-region  upon  the  application  of  the  reversed  biasing  ramp.  As  these  carriers  move  away,  the  depletion  region 
begins  to  expand.  Since  the  central  region  over  the  axis  has  the  highest  doping  and  hence  the  largest  supply  of 
mobile  carriers,  the  conduction  current  is  the  largest  over  this  section.  The  second  factor  arises  from  electron 
injection  from  the  P+  side  into  the  N-region.  This  mainly  occurs  over  the  central  axial  portion  which  has  the  highest 
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supply  of  carriers  to  initiate  impact  ionization.  The  double  humped  structure  on  the  axis  is  indicative  of  the  finite 
time  lag  between  the  initial  movement  of  electrons  within  the  N-side  as  the  depletion  region  increases,  and  the 
subsequent  injection  from  the  P+-N  junction  due  to  impact  ionization.  Thirdly,  the  strong  increase  in  the  longitudinal 
electric  field  Ex  within  the  N-region,  relative  to  the  N+  and  P+  regions  also  contributes  to  the  high  conduction  current. 
This  occurs  because  the  drift  velocity  is  appreciably  enhanced  as  a  result  of  the  electric  field  increase  which  promotes 
a  high  drift  current  value.  Figure  15  also  reveals  the  presence  of  a  highly  non-uniform  current  distribution  along 
the  transverse  dimension,  and  is  indicative  of  a  "filamentary  mode".  The  central  location  of  this  current  filament 
can  be  potentially  detrimental,  since  it  can  lead  to  internal  heating.  Heat  produced  within  the  filament  would  be 
more  difficult  to  dissipate  through  external  cooling,  given  its  central  axial  location.  Had  the  filament  been  towards 
the  outer  boundaries,  heat  dissipation  through  external  cooling  schemes  would  be  easier.  The  heating  can  potentially 
lead  to  a  second  breakdown  and  irreversible  device  failure.  These  results  therefore  indicate  that  SAS  devices  with 
large  doping  towards  the  central  axis  cannot  be  left  in  the  "ON-state"  for  a  long  duration. 

The  total  longitudinal  current  Ix  as  a  function  of  the  transverse  direction  at  different  times  is  shown  in 
Figures  1 6  and  17.  The  plots  show,  as  expected,  that  the  current  progressively  increases  as  the  biasing  pulse  is  turned 
on.  Furthermore,  the  longitudinal  currents  are  all  equal  and  there  is  no  transverse  variation.  This  result  provides 
a  consistency  check  for  the  2D  simulations  employed  here,  since  one  expects  the  gradient  of  the  total  current  to  be 
zero  based  on  the  following  argument.  Basically,  the  total  current  has  to  be  conserved,  J10'  =  Jx“l  +  Jy“‘  =  constant. 
However,  J*'  =  0  since  there  is  no  conduction  path  along  the  transverse  y-direction.  Since  J10'  is  a  constant,  the  curl 
of  J“'  has  to  be  zero.  But  the  curl  of  J*01  equals:  a,  [dJy“7dx  -  dJxtot/dy]  =  az  [0  -  dJxt0Vdy]  =  -  az  dJx“‘/dy.  As  a 
result,  dJx*°Vdy  =  0  as  borne  out  in  the  results  of  Figures  16-17. 


TWO-DIMENSIONAL  RESULTS  AND  ANALYSIS:  CASE  II 

Results  for  the  second  doping  profile  are  examined  next.  This  test  case  is  the  inverse  of  the  previous  profile, 
with  the  dopant  density  being  centro-symmetric  with  a  minimum  level  at  the  central  axis  and  the  maximum  at  the 
two  end  surfaces.  Before  discussing  the  results  obtained,  it  is  instructive  to  look  at  some  of  the  obvious  differences. 
First,  we  compare  the  average  conductive  ratio  between  the  two  cases  by  evaluating  the  dopant  charge.  Since  the 
conductance  "G"  is  proportional  to  the  carrier  density,  a  ratio  of  the  conductances  G,/Gn  =  p,‘7p1av.  For  the  dopant 
distributions  chosen  here,  this  ratio  works  out  to  about  1.2  as  given  in  equation  (2).  Designating  LT  as  the  transverse 
length  along  the  "y"  axis,  the  conductivity  ratio  is  given  as: 
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Fig.  16  Total  current  in  the  device  at  18  ns  as  a  function  of  the  transverse  axis 


Number  of  Meshes  in  Y-Direction 

Fig.  17  Total  current  in  the  device  at  20  ns  as  a  function  of  the  transverse  axis 
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Hence,  the  average  conductivity  for  case  I  is  larger  than  that  of  case  II.  It  is  therefore  follows  that  the  circuit 
currents  can  be  expected  to  be  higher  in  case  I  as  compared  to  case  II.  A  second  point  pertains  to  anticipated  relative 
values  of  the  peak  electric  field  within  the  device.  Based  on  simple  P-N  junction  theory,  the  maximum  internal 
electric  field  Emax  at  any  voltage  is  a  monotonically  increasing  function  of  the  average  doping  density  "Nav".  Thus: 
Emax  a  [Nav]° 5  .  This  implies  that  the  peak  electric  field  for  case  I  would  be  higher  than  those  for  case  II.  As  a 
result,  impact  ionization  and  the  launching  of  an  electron  wave  is  expected  to  be  relatively  strong  for  case  I  and  not 
as  dominant  for  the  case  II  device. 

Simulation  result  for  the  initial  unbiased  steady  state  ionized  density  distribution  within  the  SAS  device  is 
shown  in  Figure  1 8.  This  profile  is  in  keeping  with  the  chosen  centro-symmetric  distribution.  The  peak  magnitude 
of  the  ionized  impurity  density  nearly  equals  5  x  1016  cm'3  which  is  the  maximum  concentration  of  dopant  atoms, 
and  occurs  at  the  two  edges.  The  transient  current  corresponding  to  the  voltage  waveform  of  Fig.  3  is  shown  in 
Figure  19.  This  plot  differs  from  the  corresponding  results  for  case  I  in  two  respects,  (i)  First,  the  value  of  the 
circuit  currents  for  case  II  are  generally  lower  than  those  for  case  I.  For  instance,  the  initial  displacement  current 
spike  is  about  5.2  Amperes  in  case  I  but  only  3.6  Amperes  in  case  II.  The  maximum  currents  during  the  applied 
voltage  pulse  at  about  20  ns  and  46  ns  are  also  smaller  for  case  II.  These  decreased  current  magnitudes  are  the  result 
of  a  lower  average  conductance  Gn  for  case  II  as  compared  with  case  I.  Such  a  result  is  expected  and  was  briefly 
discussed  above  through  equation  (2).  (ii)  Next,  the  sharp  current  pulses  produced  around  20  ns  and  46  ns  during 
the  voltage  ramps  in  case  I  are  absent  in  case  II.  Instead,  a  broad  current  pulse  having  a  5  ns  time  duration  results 
in  case  II.  The  production  of  such  a  broad  pulse  is  not  very  useful  for  pulse  shaping  applications.  The  absence  of 
a  sharp  current  peak  in  case  II  suggests  a  lack  of  strong  impact  ionization  and  internal  avalanching.  This  result  is 
a  direct  consequence  of  a  lower  average  device  doping  which  leads  to  lower  values  of  the  maximum  electric  field 
Emax  as  has  been  discussed  above.  These  lower  field  values  preclude  the  onset  of  strong  avalanching,  and  the 
launching  of  a  rapid  ionization  wave.  Given  this  lack  of  strong  internal  ionization,  the  current  waveform  only 
displays  an  initial  displacement  current  rise  at  around  15  ns  and  40  ns  as  the  voltage  ramp  is  applied.  This  is 
followed  by  a  monotonic  decrease  as  the  voltage  drop  across  the  50  Ohm  series  resistor  increases. 

Results  of  the  internal  electric  field  Ex  at  10  ns  and  20  ns  are  presented  in  Figures  20  and  21.  These  plots 
reveal  growth  and  propagation  of  an  internal  field.  This  growth  and  propagation  occurs  mainly  towards  the  outer 
edges.  Correspondingly,  the  values  of  the  electric  field  over  the  outer  portions  at  the  P+-N  junction  boundary  in 
Figure  21  are  not  as  high  as  those  over  the  center.  This  is  in  keeping  with  the  requirement  that  the  line  integral  of 
the  longitudinal  field  Ex  have  a  constant  value  across  all  paths  parallel  to  the  longitudinal  axis.  The  primary 
difference  between  Fig.  2 1  and  the  case  I  plot  of  Fig.  12  lies  in  the  magnitudes  of  the  electric  field.  The  peak  values 
in  Fig.  21  are  at  about  1.7  x  107  V/m,  while  those  of  Fig.  12  are  at  1.85  x  107  V/m.  The  lower  values  of  the 
maximum  field  work  to  dramatically  reduce  the  impact  ionization  since  the  process  has  an  exponential  dependence 
on  the  electric  field.  As  a  result,  ionization  and  avalanching  in  the  device  of  case  II  is  not  as  strong,  and  does  not 
give  rise  to  a  sharp  current  amplification. 
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Fie.  18  Steady  state  ionized  impurity  profile  for  case  II  unbiased  device 
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Fig.  19  Simulation  results  for  the  transient  circuit  current  waveform 
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Fig.  20  Longitudinal  electric  field  distribution  at  10  ns 


Fig.  21  Longitudinal  electric  field  distribution  at  20  ns 


Snapshot  of  the  longitudinal  conduction  current  at  20  ns  within  the  SAS  device  is  shown  in  Figures  22. 
The  plot  shows  that  the  conduction  current  is  relatively  large  throughout  the  N+  and  the  P+  regions  as  expected  due 
to  the  large  supply  of  free  cariers.  However,  in  the  middle  N-region,  the  current  is  almost  negligible.  Furthermore, 
no  filamentary  growth  or  localized  current  flow  enhancements  are  evident.  This  result  is  completely  different  from 
that  obtained  in  Fig.  15. 

TWO-DIMENSIONAL  RESULTS  AND  ANALYSIS:  CASE  III 

Simulation  results  for  the  asymmetric  doping  profile  are  examined  next.  This  case  represents  an  aberration 
in  the  doping  distribution.  The  ionized  dopant  distribution  is  shown  in  Fig.  23.  In  this  case  the  total  dopant  charge 
inside  the  device  is  greater  than  both  of  the  two  previous  cases.  As  a  result,  the  device  conductance  "G"  is 
correspondingly  higher,  implying  that  the  circuit  currents  can  be  expected  to  be  higher  than  previously  obtained. 
Since  the  average  dopant  densities  are  higher,  the  peak  electric  field  values  within  the  device  should  also  be  larger. 
This  is  indicative  of  a  much  stronger  impact  ionization. 

The  transient  SAS  circuit  current  is  shown  in  Figure  24.  The  values  of  the  circuit  current  are  much  higher 
than  those  for  cases  I  and  II.  For  instance,  the  maximum  currents  during  the  applied  voltage  pulse  at  about  20  ns 
and  46  ns  are  about  4  and  5  Amperes,  as  compared  to  2.2  and  1.8  Amperes  for  case  I.  Such  high  values  of  the 
current  are  associated  with  the  large  device  conductance  and  the  ability  for  strong  impact  ionization  on  the  basis  of 
larger  electric  fields. 

Results  of  the  internal  electric  field  Ex  at  18  ns  and  20  ns  are  presented  in  Figures  25  and  26  which  show 
the  propagation  on  an  internal  electric  field  wave.  The  electric  wave,  however,  does  not  move  into  the  N-region  in 
a  uniform  manner.  Instead,  its  dynamics  and  propagation  is  determined  by  the  doping  non-uniformity  along  the 
transverse  direction.  For  example,  the  region  of  highest  doping  occurs  over  a  longitudinal  strip  located  between 
meshes  7  and  9  along  the  Y-direction.  The  electric  field  at  the  P+-N  junction  is  therefore  highest  over  Y-meshes  7 
to  9,  with  the  smallest  penetration  into  the  N-region.  The  small  penetration  is  the  result  of  a  large  slope  in  the  Ex 
profile,  since  the  slope  is  proportional  to  the  ionized  dopant  density  in  accordance  with  Gauss’  law.  Similarly,  values 
of  the  electric  field  maxima  at  the  P+-N  junction,  away  from  the  strip  between  Y-meshes  7  through  9,  are  not  as 
large.  However,  the  field  extends  deeper  into  the  N-region  due  to  the  smaller  slope  in  Ex(x)  associated  with  the 
smaller  ionized  dopant  density.  On  application  of  the  voltage  pulse,  the  fields  begin  to  increase  in  magnitude  and 
spread  into  the  N-region.  However,  the  wavefront  moves  non-uniformly  along  the  transverse  direction.  Electric  field 
propagation  into  the  N-region  over  the  strip  between  Y-meshes  7  to  9,  is  the  slowest  because  of  the  high  doping. 
The  peak  value,  on  the  other  hand,  exhibits  a  fast  growth  as  seen  in  Fig.  25.  Conversely,  the  propagation  and 
spreading  of  the  electric  field  over  the  regions  outside  the  high  doping  strip  is  much  faster,  even  though  the  peak 
field  values  do  not  increase  quite  as  much.  This  behavior  is  evident  in  Fig.  25  which  shows  an  Ex  wave  moving 
faster  in  an  asymmetric  fashion.  At  20  ns,  this  non-uniform  movement  is  even  more  apparent.  A  distinct  hump  is 
seen  towards  the  left  surface  of  the  device  roughly  given  by  mesh  coordinates  X  =  80,  Y  =  2. 
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Finally,  plots  of  the  longitudinal  conduction  current  at  20  ns  are  shown  in  Figs.  27.  The  plot  indicates  that 
unlike  the  previous  cases,  the  conduction  current  is  not  largest  within  the  N+  and  the  P+  regions.  This  is  due  to  the 
relatively  negligible  electric  fields  Ex  in  these  regions  which  reduces  the  carrier  drift  velocities,  and  hence  the 
conduction  current,  to  negligible  levels.  Instead,  there  are  two  localized  areas  of  large  conduction  current.  These 
occur  within  the  N-region  near  the  P+-N  junction  at  the  two  transverse  ends,  and  are  removed  from  the  asymmetric 
highly  doped  strip.  These  areas  of  high  conduction  current  are  also  seen  to  be  moving  towards  the  N+  region.  This 
movement  is  brought  about  by  the  propagating  electric  field. 

There  is  some  similarity  between  this  situation  and  that  chosen  for  case  I.  Both  involve  convex  shaped 
transverse  doping  profiles  with  minima  at  the  two  outer  boundaries.  One  might  therefore  expect  the  conduction 
current  distributions  to  be  somewhat  similar.  However,  a  comparison  of  Figs.  27  and  15  shows  that  this  is  not  the 
case.  For  example,  while  case  I  exhibits  a  partial  conductive  filament  at  the  central  axis,  most  of  the  conduction 
current  in  Fig.  27  occurs  towards  the  two  boundaries  away  from  the  center.  This  difference  can  be  attributed  to  the 
following  features:  (i)  The  existence  of  a  much  stronger  transverse  diffusive  process  as  compared  to  case  I,  due 
to  the  sharper  gradient  in  the  doping  profile.  This  diffusion  works  to  force  mobile  carriers  away  from  the  higher 
doped  asymmetric  longitudinal  strip,  towards  the  outer  boundaries  more  efficiently,  (ii)  The  presence  of  much 
higher  values  of  the  maximum  electric  fields  Ex  throughout  the  P+-N  junction  as  compared  to  case  I.  The  doping 
for  case  III  is  the  same  or  higher  than  that  of  case  I  for  all  points.  The  higher  fields  lead  to  exponential  increases 
in  the  ionization  coefficients. 

The  two  factors  mentioned  above  work  to  produce  the  following  sequence  within  the  N-region.  As  the  turn¬ 
on  voltage  ramp  is  applied,  the  mobile  carriers  that  lie  just  outside  the  P+-N  depletion  region  begin  moving.  This 
produces  a  conduction  current  wave  and  the  free  carrier  density  begins  to  increase  within  regions  to  the  right. 
However,  the  process  of  diffusion  which  is  strong  in  this  case  (but  not  for  case  I),  forces  the  excess  carriers  away 
from  the  asymmetric  high  doped  longitudinal  strip  and  towards  the  outer  boundaries.  With  time,  the  internal  electric 
fields  increase  leading  to  larger  values  of  Emax  at  the  P+-N  junction  boundary  and  the  fields  expand  deeper  into  the 
N-region.  The  former  initiates  the  impact  ionization  process  and  promotes  electron  injection  into  the  N-region.  The 
latter  enhances  the  local  drift  velocity  values,  significantly  increasing  the  drift  velocity  and  hence  the  conduction 
current.  However,  since  the  electric  field  does  not  move  as  fast  nor  does  it  increase  as  much  over  the  asymmetric 
strip,  the  growth  in  the  drift  velocity  is  not  quite  as  large.  Besides,  strong  carrier  diffusion  forces  the  mobile  carriers 
away  from  the  strip  and  towards  the  two  boundaries. 

The  occurrence  of  a  crest  in  the  conduction  current  of  Fig.  27  signifies  the  formation  and  movement  of 
single  electronic  wavepacket.  The  production  of  only  a  single  wavepacket  or  electronic  swarm  is  influenced  by  the 
internal  polarization  effect.  As  the  electrons  get  injected  into  the  N-region  from  the  P+  -N  junction  following  impact 
ionization,  the  value  of  the  maximum  electric  field  begins  to  collapse  due  to  the  polarization.  This  reduces  the  fields 
below  the  impact  ionization  threshold,  and  prevents  the  subsequent  and  continuous  launching  of  electron  waves  into 
the  N-region.  As  a  result,  only  one  swarm  or  packet  of  electrons  is  injected  into  the  N-region.  Furthermore,  the 
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reduction  in  the  applied  bias  at  20  ns,  also  has  the  effect  of  shutting  off  the  impact  ionization  process. 

TWO-DIMENSIONAL  RESULTS  AND  ANALYSIS:  CASE  IV 

Finally,  2D  simulation  results  for  an  SAS  device  having  a  uniform  dopant  profile  along  the  transverse 
direction  were  also  obtained  for  completeness.  Fig.  28  shows  the  density  of  internal  immobile  charge,  which  has 
no  variations  along  the  transverse  direction  as  expected.  The  transient  current  of  Figure  29  shows  a  behavior 
somewhat  similar  to  that  of  Fig.  9.  The  growth  and  propagation  of  an  electric  field  wave  starting  from  the  P+-N 
junction  is  evident  in  Figs.  30  and  3 1 .  Again  there  are  no  variations  along  the  transverse  axis  due  to  the  uniformity 
of  the  dopant  profile.  This  leads  to  near-zero  transverse  electric  fields  Ey  within  the  device  at  all  times  as  shown 
in  Fig.  32.  Finally,  the  conduction  current  distributions  within  the  simulated  SAS  device  at  two  different  times 
during  the  applied  voltage  pulse  are  shown  in  Figs.  33  and  34.  From  the  plot  of  Fig.  33  it  can  be  seen  that  the 
conduction  currents  within  the  P+  and  N+  regions  are  almost  equal  and  uniform.  Since  a  high  density  of  free  carriers 
is  available  in  these  regions,  the  drift  (and  hence  the  conduction  current)  component  dominates  and  is  nearly  equal 
to  the  total  current  IT.  Requirement  of  current  continuity  and  the  conservation  of  total  current  IT  therefore  dictates 
that  the  conduction  currents  in  the  P+  and  N+  regions  nearly  equal  IT  and  hence  match  each  other.  The  current  within 
a  portion  of  the  N-region  that  lies  towards  the  N-N+  boundary  is  also  composed  mainly  of  the  drift  current.  This 
region  is  just  beyond  the  P+-N  depletion  boundary,  and  has  a  supply  of  free  electrons  available  for  conduction.  Even 
though  their  densities  are  slightly  lower  than  those  within  the  N+  and  the  P+  regions,  a  larger  drift  velocity  driven 
by  higher  electric  fields  compensates  for  the  difference.  This  occurs  because  the  electric  field  increases  more  quickly 
in  the  N-region  than  in  either  of  the  P+  or  N+  regions.  However,  the  absence  of  free  within  the  depletion  region  near 
the  P+-N  junction,  leads  to  near-zero  conduction  current  levels.  This  is  clearly  evident  in  Fig.  33.  At  a  later  time 
of  20  ns,  the  process  of  avalanching  and  electron  injection  over  the  P+-N  junction  changes  the  conduction  current 
distribution.  As  the  electric  field  at  the  P+N  junction  increases  and  crosses  the  impact  ionization  threshold  during 
the  voltage  ramping,  a  wave  of  electrons  is  injected  into  the  N-region.  This  produces  two  effects:  (i)  The  drift 
component  is  increased  within  the  N-region  lying  near  the  P+  boundary,  and  (ii)  The  rate  of  increase  in  the  electric 
field  is  lowered.  This  decrease  in  the  rate  can  also  be  viewed  as  a  polarization  effect.  Suppression  of  the  electric 
field  due  to  this  polarization  effect,  prevents  the  subsequent  launching  of  electron  waves  into  the  N-region. 
Furthermore,  turning  off  the  applied  voltage  at  20  ns,  also  precludes  a  continuous  and  steady  electron  injection.  As 
a  result,  only  one  swarm  or  packet  of  electrons  are  injected  into  the  N-region.  A  high  conduction  current  is 
associated  with  this  electron  packet  as  it  moves  in  the  N-region  towards  the  N+  side.  This  appears  in  the  snapshot 
picture  of  Fig.  34  as  the  crest  of  a  conduction  current  wave  located  near  the  P+-N  boundary. 

SUMMARIZING  CONCLUSIONS 

In  this  research,  a  two-dimensional  simulator  for  the  SAS  device  has  been  developed  based  on  the  drift- 
diffusion  model.  It  represents  the  first  attempt  to  obtain  accurate  device  performance  predictions  and  the  anticipated 
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transient  response.  The  2D  semiconductor  model  was  successfully  implemented  into  a  numerical  code.  The  role 
of  transverse  variations  in  the  doping  profile  were  then  analyzed  by  obtaining  the  transient  current  characteristics. 
Four  different  doping  profiles  were  studied  through  the  numerical  simulations.  The  dynamics  of  development, 
growth  and  propagation  of  internal  electric  field  waves  were  also  analyzed.  These  studies  provide  useful  insight  into 
the  physics  of  the  avalanche  process.  Finally,  predictions  of  potential  current  filamentation  within  the  device  for  a 
given  doping  profile  were  obtained.  Simulation  data  for  the  conduction  current  distribution  within  the  device  at 
various  time  instants,  clearly  showed  the  possibility  of  filamentary  mode  growth. 

A  number  of  important  conclusions  can  be  drawn  from  the  results  presented  in  the  previous  chapter.  The 
main  deductions  and  salient  features  are  as  listed  below. 

(1)  The  dopant  densities  are  seen  to  strongly  affect  the  circuit  currents  by  controlling  the  device  conductance. 
Both  the  magnitude  and  the  temporal  shape  can  vary  dramatically  with  changes  in  the  doping  profiles.  In 
general,  the  current  magnitudes  are  higher  for  larger  doping.  This  is  the  combined  result  of  two  factors: 
(i)  An  increased  conductance  which  reduces  the  overall  resistance  in  the  circuit,  and  (ii)  the  increased  value 
of  the  electric  field  at  the  p-n  junction  which  enhances  impact  ionization. 

(2)  Current  pulses  having  durations  in  the  sub-nanosecond  range  were  seen  to  result  upon  the  application  of 
much  broader  voltage  pulses.  However,  there  seems  to  be  a  trade-off  between  current  pulse  sharpness  and 
the  potential  for  instability  linked  to  current  filamentation.  Sharp  and  narrow  current  pulses,  are  associated 
with  the  growth  of  filamentary  currents.  The  sharp  pulses  were  the  direct  result  of  strong  impact  ionization 
and  avalanche  action.  However,  the  avalanche  process  can  be  very  non-uniform  due  to  doping  variations 
giving  rise  to  localized  surges  in  the  conduction  current.  Though  such  localized  current  may  not  be 
detrimental  over  a  short  duration,  the  device  could  be  destroyed  during  repetitive  operation  as  a  result  of 
prolonged  internal  heating.  The  absence  of  sharp  current  pulses,  on  the  other  hand,  provides  for  more  stable 
operation. 

(3)  Evidence  of  a  distinct  electric  field  wave  propagating  from  the  cathode  to  the  anode  of  the  SAS  device  was 
presented  through  the  various  simulations.  The  movement  and  propagation  of  the  electric  field  wave  was 
seen  to  be  non-uniform  in  general,  and  dependent  on  the  doping  variations  along  the  transverse  dimension. 
Larger  values  of  the  doping  created  a  slower  moving  wave  since  the  associated  depletion  regions  did  not 
increase  quite  as  rapidly. 

(4)  Associated  with  a  propagating  electric  field  wave,  was  a  travelling  conduction  current.  For  "near-uniform" 
profiles  along  the  transverse  direction,  the  conduction  current  was  seen  to  develop  over  regions  of  highest 
doping  density,  and  to  propagate  away  towards  the  anode.  Furthermore,  this  conduction  current  was  due 
to  mobile  electrons.  This  result  seems  to  suggest  that  slight  increases  in  the  dopant  density  over  the  central 
portions  of  the  SAS  device  could  potentially  be  detrimental.  The  corresponding  conduction  current  increases 
and  possible  temperature  enhancements  over  the  central  portion,  would  be  more  difficult  to  control  or 
quench.  Dopant  density  increases  near  the  outer  surfaces  or  peripheral  regions  might  present  a  less  acute 
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problem  since  the  associated  temperature  enhancements  could  be  controlled  through  external  cooling. 
Fortunately,  the  doping  profiles  resulting  from  the  diffusion  process  naturally  provide  such  doping 
enhancements  over  the  peripheral  regions. 

(5)  Carrier  diffusion  was  seen  to  be  important  for  profiles  having  strong  gradients  in  the  doping  profiles.  For 
instance,  results  of  the  previous  section  showed  that  localized  areas  of  high  conduction  currents  could  arise 
in  regions  with  moderate  doping.  This  was  caused  by  rapid  transverse  diffusion.  This  suggests  that  there 
is  an  optimal  gradient  of  the  doping  along  the  transverse  direction  which  would  lead  to  a  more  uniform 
current  flow.  Since  "near-uniform"  profiles  cause  filaments  over  regions  of  highest  doping,  while  sharp 
gradients  give  rise  to  filaments  over  regions  of  relatively  low  doping,  a  moderate  gradient  would  lead  to 
a  more  even  current  distribution. 

(6)  The  impact  ionization  and  avalanching  processes  within  the  SAS  device  are  caused  by  holes  rather  than 
electrons.  During  the  turn-on  transient,  holes  move  from  the  N-region  towards  the  P+  side  of  the  junction 
in  response  to  the  voltage  pulse.  The  reason  that  holes  dominate  the  carrier  generation  process  as  compared 
to  electrons,  is  associated  with  the  doping  densities  on  either  side  of  the  P+-N  junction.  The  minority 
electron  densities  on  the  P+  side  are  relatively  much  smaller  than  the  minority  hole  densities  on  the  N-side. 
Consequently,  on  applying  the  voltage  ramp,  more  minority  holes  are  able  to  traverse  the  high  field  region 
at  the  P+-N  junction  boundary  from  the  N-side,  than  minority  electrons  from  the  P+-side.  As  a  result,  carrier 
generation  is  facilitated  mainly  by  the  overwhelming  population  of  moving  minority  holes. 

(7)  Finally,  the  SAS  devices  were  seen  to  require  at  least  25  ns  to  make  a  complete  recovery  to  their  original 
state  following  a  voltage  pulse.  This  time  is  expected  to  increase  with  the  transverse  dimensions  of  the 
device.  Such  increments  in  the  transverse  device  dimensions  may  be  necessary  for  enhancing  the  current 
carrying  capability.  However,  by  increasing  the  transverse  dimensions,  the  recovery  time  would  be 
adversely  affected. 

(8)  As  regards  the  doping  of  the  N+  and  P+  contact  regions,  the  simulation  data  suggests  that  increases  in  the 
concentration  would  lead  to  enhancements  in  the  circuit  currents.  The  process  of  impact  ionization  would 
also  increase  due  to  the  higher  values  of  the  electric  fields  at  the  junction.  Hence,  a  high  doping  density 
of  about  1018  cm'3  at  the  P+  and  N+  end  regions,  appears  to  be  a  desirable  goal.  These  levels  can  realistically 
be  attained  through  high  temperature  diffusion. 

SCOPE  FOR  FUTURE  RESEARCH  WORK 

The  research  presented  here  focussed  on  developing  numerical  simulation  codes  for  the  two-dimensional 
modeling  of  the  SAS  switches.  This  technology  has  shown  promise  over  other  optical  based  switching  approaches. 
The  simulations  developed  here  were  meant  to  analyze  a  simple  device  geometry  and  to  provide  insight  into  the 
physics  of  device  operation.  The  model  was  thus  a  first  step  towards  the  development  of  a  more  powerful  two- 
dimensional  device  simulator.  However,  many  other  complicated  features  could  be  incorporated  as  additions  to 
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enhance  the  utility  and  applicability  of  the  simulator.  Including  the  enhancements  would  constitute  the  next  set  of 

research  tasks.  A  summary  of  the  various  tasks  and  the  scope  of  future  research  are  listed  below. 

(i)  Development  of  an  enhanced  two-dimensional  model  to  include  non-uniform  device  cross-sections.  The 
non-uniform  cross  sections  may  be  due  to  the  planar  processing  techniques  used  during  the  fabrication 
process.  In  such  a  case,  the  two  metallic  contacts  to  the  device  would  typically  be  placed  on  the  same  front 
surface.  The  cross-sections  could  also  be  non-uniform  to  suppress  surface  flash-over  and  dielectric 
breakdown.  In  order  to  avoid  surface  flash-over,  for  instance,  the  path  length  between  the  anode  and 
cathode  has  to  be  increased  as  much  as  possible.  Tapered  trapezoidal  cross-sections  have  been  proposed 
in  this  regard  to  reduce  the  magnitude  and  the  location  of  the  maximum  internal  electric  field  distribution 
within  the  device. 

(ii)  Inclusion  of  more  complex  external  circuits.  Elements  such  as  inductors  need  to  be  included  for  more 
realistic  simulations.  Such  inductive  elements  could  either  be  part  of  an  actual  pulse  shaping  circuit  or  may 
be  associated  with  the  self-  and  mutual-inductances  of  the  connecting  wires. 

(iii)  Allowing  for  more  complicated  variations  in  the  doping  profiles  along  the  transverse  direction.  Such 
variations  could  include  localized  clumps  and  clusters,  point  defects  and  other  profiles  lacking  translational 
symmetry  along  both  the  longitudinal  and  transverse  directions.  This  would  also  reveal  the  potential  for 
current  filamentation  due  to  the  presence  of  localized  traps. 

(iv)  The  numerical  simulations  could  also  be  performed  for  a  class  of  other  voltage  waveforms.  In  the  present 
study  only  one  waveform  was  used  throughout.  Furthermore,  the  magnitudes  of  the  peak  turn-on  voltage, 
its  temporal  shape,  and  turn-on  slope  could  be  varied  for  a  more  extensive  evaluation. 

(v)  In  this  study,  only  P+-N-N+  structures  were  examined.  As  an  extension,  other  possibilities  such  as  the  P+-P- 
N+  or  the  P+-I-N+  structures  could  be  analyzed  as  well. 

(vi)  This  work  focussed  on  the  transient  electrical  response  of  a  single  SAS  device.  However,  for  applications 
requiring  voltage  scaling,  a  number  of  such  SAS  devices  may  have  to  be  used  in  a  serially  stacked  mode. 
Evaluating  the  performance  of  such  a  composite  stacked  arrangement  could  be  another  direction  for 
extending  the  current  simulations. 

(vii)  A  final  area  of  future  research  in  connection  with  the  SAS  shaper  devices  might  be  to  extend  the  analysis 
to  heterojunction  structures.  Instead  of  having  an  all-Silicon  device,  the  potential  benefits  of  a  Si-Ge 
heterojunction  structure  could  be  examined. 
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THEORY  OF  WAVE  PROPAGATION  IN  A  TIME  -  VARYING  MAGNETOPLASMA 
MEDIUM  AND  APPLICATIONS  TO  GEOPHYSICAL  PHENOMENA 


Dikshitulu  K.  Kalluri 
Professor 

Department  of  Electrical  Engineering 
University  of  Massachusetts  Lowell 

Abstract 

Using  one  dimensional  theory,  several  aspects  of  electromagnetic  wave  transformation  in  a 

dynamic  magnetized  plasma  are  considered.  They  may  be  grouped  under  the  following 

categories 

1 .  Effects  of  finite  rise  time  and  range  length  on  the  reflection  coefficients  of  a  dynamic 
inhomogeneous  plasma  medium. 

2.  Alteration  of  pulse  duration  in  a  dynamic  magnetized  plasma  medium 

3.  Green's  function  and  perturbation  technique  for  a  switched  time  -  varying  magnetoplasma 
medium 

4.  Application  of  the  theoretical  results  developed  in  1  -  3  to  the  investigation  of  lightning 
induced  ionospheric  effects  and  other  geophysical  phenomena  connected  with  dynamic 
magnetized  plasma  medium.  This  effort  will  continue  in  active  collaboration  with  my  focal 
point . 
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THEORY  OF  WAVE  PROPAGATION  IN  A  TIME  -  VARYING  MAGNETOPLASMA 
MEDIUM  AND  APPLICATIONS  TO  GEOPHYSICAL  PHENOMENA 

Dikshitulu  K.  Kalluri 

1.  General  Introduction 

A  brief  review  and  references  to  the  previous  work  on  the  theory  of  wave  propagation  in  a  time  - 
varying  magnetoplasma  medium  are  given  in  this  section.  A  comprehensive  account  may  be 
found  elsewhere  [1]. 

1.1  Switched  Isotropic  Plasma  Medium:  A  and  B  waves 

Jiang  [2]  investigated  the  effect  of  switching  suddenly  an  unbounded  isotropic  cold  plasma 
medium.  Since  the  plasma  was  assumed  to  be  unbounded,  the  problem  was  modeled  as  a  pure 
initial  value  problem.  The  wave  propagation  in  such  a  medium  is  governed  by  the  conservation  of 
the  wave  number  across  the  time  discontinuity  and  consequent  change  in  the  wave  frequency. 
Rapid  creation  of  the  medium  may  be  approximated  as  a  sudden  switching  of  the  medium.  The 
main  effect  of  switching  the  medium  is  the  splitting  of  the  original  wave  (  henceforth  called  incident 
wave,  in  the  sense  of  incidence  on  a  time  discontinuity  in  the  properties  of  the  medium  )  into  new 
waves  whose  frequencies  are  different  from  the  incident  wave.  Recent  experimental  work  [3]  -  [5] 
and  computer  simulation  [6]  demonstrated  the  frequency-upshifting  of  microwave  radiation  by 
rapid  plasma  creation. 

In  any  practical  situation,  the  plasma  is  bounded  and  the  problem  can  no  longer  be  modeled  as  a 
pure  initial  value  problem.  Kalluri  [7]  dealt  with  this  aspect  by  considering  the  reflection  of  a 
traveling  wave  when  an  isotropic  cold  plasma  is  switched  on  only  over  z  >  0  half-space.  Kalluri  has 
shown  that  the  reflected  field  in  free  space  is  comprised  of  two  components,  A  and  B.  The  A 
component  is  due  to  reflection  at  the  spatial  discontinuity  at  z  =  0  formed  at  t  =0.  Its  frequency  is 
the  same  as  that  of  the  incident  wave.  The  temporal  discontinuity  gives  rise  to  two  additional 
waves  in  the  plasma.  These  are  called  B  waves.  One  of  them  propagates  in  the  negative  z- 
direction,  and  it  is  this  wave  which  undergoes  partial  transmission  into  free  space.  The  frequency 
of  the  B  component  is  different  from  that  of  the  incident  wave. 
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Kalluri  et  al.  [8]  recently  brought  the  solved  problem  closer  to  the  practical  situation  by  considering 
the  switched  plasma  medium  as  a  slab  of  width  d  (  Figure  1).  A  qualitative  picture  of  the  reflected 
and  transmitted  waves  is  given  in  the  figure.  If  the  incident  wave  frequency  coo  is  less  than  the 
plasma  wave  frequency  cop,  i.e.  coo  is  in  the  stop  band  of  the  medium,  the  transmitted  wave  will  be 
a  B  wave  of  the  new  [2]  frequency  co-j  =  ( coo2  +  cop2  )1/2  . 

The  solution  of  the  slab  problem  was  complex  but  it  would  be  more  complex  if  the  posed  problem 
was  to  be  brought  even  closer  to  the  practical  situation  i.e.,  the  problem  of  switching  a  space- 
varying  and  time-varying  plasma  of  finite  extent.  However,  since  the  B  waves  are  generated  in  the 
plasma  their  basic  nature  in  terms  of  the  frequency  shift,  power  and  energy  in  them  and  how  they 
are  damped  can  all  be  studied  qualitatively  by  considering  the  switching  of  the  unbounded 
medium. 


1.2  Switched  Unbounded  Magnetoplasma  Medium:  L,  R,  O  and  X  Waves 

It  is  known  [9]  that  the  presence  of  a  static  magnetic  field  will  influence  the  frequency  shift  etc. 
Rapid  creation  of  a  plasma  medium  in  the  presence  of  a  static  magnetic  field  may  be  approximated 
as  the  sudden  switching  of  a  magnetoplasma  medium.  An  idealized  mathematical  description  of 
the  problem  is  given  next.  A  uniform  plane  wave  is  traveling  in  free  space  along  z  direction  and  it  is 
assumed  that  at  t  =  0-,  the  wave  occupies  the  entire  space.  A  static  magnetic  field  of  strength  Bo 
is  assumed  to  be  present  throughout.  At  t  =  0  the  entire  space  is  suddenly  ionized  thus 
converting  the  medium  from  free  space  to  a  magnetoplasma  medium.  The  incident  wave 
frequency  is  assumed  to  be  high  enough  that  the  effects  of  ion-motion  may  be  neglected. 

The  strength  and  the  direction  of  the  static  magnetic  field  effect  the  number  of  new  waves 
created,  their  frequencies,  and  the  power  density  of  these  waves.  Some  of  the  new  waves  are 
transmitted  waves  ( waves  propagating  in  the  same  direction  as  the  incident  wave  )  and  some  are 
reflected  waves  (  waves  propagating  in  the  opposite  direction  to  that  of  the  incident  wave  ). 
Reflected  waves  tend  to  have  less  power  density. 

A  physical  interpretation  of  the  waves  may  be  given  in  the  following  way.  The  electric  and 
magnetic  fields  of  the  incident  wave  and  the  static  magnetic  field  accelerate  the  electrons  in  the 
newly  created  magnetoplasma,  which  in  turn  radiate  new  waves.  The  frequencies  of  the  new 
waves  and  their  fields  can  be  obtained  by  adding  contributions  from  the  many  electrons  whose 
positions  and  motions  are  correlated  by  the  collective  effects  supported  by  the  magnetoplasma 


6-4 


6-5 


medium.  Such  a  detailed  calculation  of  the  radiated  fields  seems  to  be  quite  involved.  A  simple, 
but  less  accurate,  description  of  the  plasma  effect  is  obtained  by  modeling  the  magnetoplasma  as 
a  dielectric  medium  whose  refractive  index  is  computed  through  magnetoionic  theory  [10].  The 
frequency  of  the  new  waves  are  constrained  by  the  requirements  that  the  wave  number  (ko)  is 
conserved  over  the  time  discontinuity  and  the  refractive  index  n  is  that  applicable  to  the  type  of 
wave  propagation  in  the  magnetoplasma.  This  gives  a  conservation  [1 1]  law  ko  c  =  coo  = 

n  (co)  co  from  which  co  may  be  determined.  Solution  of  the  associated  electromagnetic  initial  value 
problem  gives  the  electric  and  magnetic  fields  of  the  new  waves.  Using  this  approach  the  general 
aspects  of  wave  propagation  in  a  switched  magnetoplasma  medium  were  discussed  earlier 
[9],[12],[13]  for  the  cases  of  L,  R  and  X  incidence  [10]. 

L  incidence  is  an  abbreviation  used  to  describe  the  case  where  the  incident  wave  has  left  -  hand 
circular  polarization  and  the  static  magnetic  field  is  in  the  z  direction  (  Fig.  2  (a)).  Three  new  waves 
with  left  -  hand  circular  polarization  labeled  as  LI,  L2  and  L3  are  generated  by  the  medium 
switching.  LI  is  a  transmitted  wave.  R  incidence  refers  to  the  case  where  the  incident  wave  has 
right  -  hand  circular  polarization  and  the  static  magnetic  field  is  in  the  z  direction  .  The  medium 
switching  ,  in  this  case,  creates  three  R  waves  labeled  as  R1,  R2  and  R3.  R1  and  R3  are 
transmitted  waves.  The  frequencies  of  the  R  waves  are  the  roots  of  the  cubic  characteristic 
equation  [9]: 

“n  -®b“'-(“o  +  “p>“n+  “o“b=  0  (121) 

In  the  above  ©b  is  the  electron  gyrofrequency  and  the  incident  wave  variables  are  denoted  by 
zero  subscript.  A  negative  value  for  ©n  [9]  indicates  that  the  wave  is  a  reflected  wave  i.e.  ,  it 
propagates  in  a  direction  opposite  to  that  of  the  source  ( incident )  wave.  If  the  source  wave  is 
right-going,  then  the  reflected  wave  will  be  left-going. 


X  incidence  refers  to  the  case  where  the  electric  field  of  the  incident  wave  is  in  the  x  direction  and 
the  static  magnetic  field  is  in  the  y  direction.  The  medium  switching  in  this  case  creates  four 
extraordinary  waves  labeled  as  XI,  X2,  X3  and  X4.  XI  and  X2  are  transmitted  waves.  The 
frequencies  of  these  waves  are  : 


+  2<Op)  +  (<Dp+  °>0<<»b+<»p))  =0 


( 1.2.2) 
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Right-Hand  Circular 
Polarization 


t<0 


(b) 


FIG.  2  L  and  R  Waves 


1.3  Some  Remarkable  Effects 


The  main  effect  of  suddenly  switching  a  magnetized  medium  is  the  splitting  of  the  source  wave 
into  new  waves  whose  frequencies  are  either  upshifted  or  downshifted.  A  slow  switching  of  the 
medium  modifies  the  frequency  and  amplitude  [14],  [15]  of  the  source  wave.  For  certain  choice  of 
the  parameter  change  of  the  switched  medium,  some  remarkable  effects  like  frequency  shifting 
with  power  intensification  [16],  [17]  and  conversion  of  the  whistler  wave  into  a  wiggler  magnetic 
field  occur. 

This  report  contains  a  brief  discussion  of  the  new  results  obtained  during  the  execution  of  the 
SREP  95  proposal.  Several  of  these  results  are  being  prepared  for  publication  in  refereed 
journals. 

2.  Effect  of  Variation  of  Electron  density  in  Space  and  Time 
2.1  Introduction 


A  sudden  creation  of  plasma  half  space  considered  in  [7]  is  a  mathematical  idealization  of  a  rapid 
creation  of  a  plasma  medium  with  a  sharp  boundary.  Here  the  square  of  the  plasma  frequency 
top2(z,  t)  is  given  by 


“p(z.t)  =  O)p0u(z  )u(t) 


(2.1.1) 

where  u(.)  is  a  step  function.  The  steady  state  reflected  field  in  free  space  is  comprised  of  A  and  B 
waves  (  section  1)  and  the  reflection  coefficients  Ra  and  Rb  are  given  in  [7], 


A  more  realistic  dynamic  plasma  medium  problem  should  model  the  cop2  (z,  t)  profile  by 

“p  ( 2- 1  >  =“p,+(“p2-“p,)'i(z  >f2(t>  (212) 

where  copi  is  the  plasma  frequency  of  the  medium  1  as  z  ->  -  oo  and  t  ->  -  <»  and  cop2  is  the 
plasma  frequency  of  the  medium  2  as  z->  °°  and  t  ->  and  fi  (z)  and  f2(t)  functions  describe  the 
transition  of  the  plasma  frequency  from  its  asymptotic  values.  Figures  3  (a)  and  3  (b)  show  a  non 
periodic  change  of  the  plasma  frequency  from  medium  1  to  medium  2  over  a  length  scale  (range 
length)  L  and  a  time  scale  Tr  (  rise  time).  Let  the  source  monochromatic  plane  wave  with 
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(b) 


♦  reference  step  profile 
I  arbitrary  profile 


FIG.  3  (a)  Spatial  profile  (b)  Temporal  profile 
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frequency  coi  be  propagating  in  z  direction  in  medium  1  of  plasma  frequency  copi  .The  wave 
number  q-j  i  of  the  source  wave  is  given  by: 


=  “l  ^0ei(co1)=lV  ©*- 


00 


pi 


(2.1.3) 


The  Instantaneous  frequencies  (  ±  co(t ) )  of  the  transmitted  and  reflected  B  waves  satisfy  the 
equation 


(2.1.4) 


and  the  asymptotic  values  of  these  frequencies  ( ±  C02  )are  given  by 


co2 


(2.1.5) 


A  profile  approximation  of  (2.1 .2)  valid  for  many  practical  problems  is  to  restrict  (2.1 .2)  to  a  finite 
rise  time  Tr  and  a  finite  range  length  L.  The  limiting  case  of  L  approaching  zero  and  Tr 
approaching  zero  will  give  the  step  profile  function: 

“p<z'‘>  =“pi+<“p2-“p,>u< z  >u<t)  (216) 


The  steady-state  solution  for  this  profile  is  easily  written  by  generalizing  the  results  given  in  [7]  and 
are  discussed  in  the  subsection  2.2.  The  effects  of  non  zero  values  for  L  and  Tr  on  the  reflection 
coefficients  Ra  and  Rb  of  A  and  B  waves  are  discussed  in  the  subsections  2.3  and  2.4.  An 
approximate  analysis  of  the  problem  may  be  done  by  noting  that  the  range  length  L  primarily 
affects  Ra  and  the  rise  time  Tr  primarily  effects  Rb.  The  problem  may  thus  be  decomposed  into 
two  simpler  problems.  The  first  problem  assumes  a  time  -  invariant  space  -varying  profile  with 
range  length  L.  The  second  problem  assumes  a  space  -  invariant  time  -varying  profile  with  rise  time 
Tr.  Solution  of  each  of  the  problems  involves  the  following  steps. 


Stepl  Obtain  the  scattering  coefficients  R  and  T  for  the  reference  step  profile. 
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Step  2  Obtain  the  Green's  function  for  the  step  profile 

Step  3  Obtain  the  equation  for  the  correction  terms  AR  and  AT  in  terms  of  the  Green's 
function  and  the  solution  for  the  reference  profile.  Obtain  a  perturbation  series 
solution  valid  for  arbitrary  profiles. 

Step  4  Obtain  a  simple  approximation  for  AR  and  AT  based  on  the  dominant  term  in  the 
series  valid  for  a  small  value  for  the  range  length  L  or  the  rise  time  Tr. 

The  details  of  these  derivations  [18]  -  [19]  are  omitted  from  this  report  and  the  results  are  given  in 

the  subsections  2.2  -  2.4. 

2.2  Results  for  the  Step  profile 


The  normalized  electric  field  amplitude  eBN  /Eo  ( normalized  with  respect  to  the  incident  wave 
amplitude)  of  the  negative  B  wave  of  frequency  (  -  W2)  and  wave  number  qn  >  created  in  the 
switched  medium  is  given  by 

-C  _  eBN  _  CO2-CO1 
■E-BN  -  -5 - - 

Eq  2co2  (2.2.4) 


The  reflection  coefficient  Rb  is  given  by  multiplying  (2.2.4)  with  the  transmission  coefficient 
TA2i: 


T 


A21 


qi2+ 
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(2.2.5) 


(2.2.6) 


_  co2  -  co1  2qn 

®  2co0  n  +  a 

2  H12  M11 


In  the  above  qi2  is  the  wave  number  of  the  B  wave  of  frequency  ©2  in  medium  1 : 

9-12  =  ®2  */|I0  £1(0)2)  =  -  0)^  =  -Vcof  +  C0p2  -  2©^ 

(2.2.7) 

2.3  Correction  due  to  finite  rise  time 

The  correction  to  Ebn  due  to  an  arbitrary  electron  density  profile  of  small  rise  time  Tr  is  given  by 
AEbn  =  T?dx  A  <4  (T)  —  izlse 

rl  J  (2.3.1) 

where 


EBp  = 


©!  +  co2 
2co2 


A  C0p  (t)  =  ©J  ( t )  —  c5 J  (t) 


( 2.3.3 ) 


(2.3.2) 


and  cop2  is  the  reference  step  profile.  If  we  choose  the  origin  so  that  the  area  under  the  A  cop2 
curve  is  zero,  (  2.3.1)  simplifies  to 


AE 


BN 


__  CO  ^  +  CO2 

2  co2 


*Tr2 

-Tr, 


d  x  A  C0p  ( x )  x 


(2.3.4) 


If,  furthermore,  the  profile  is  linear 


A  EBn 


co2  +  COi 

2  co2 


2  4 


and 


(2.3.5) 
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A  RB  -  T A2i  A  Ebn 


( 2.3.6 ) 


2.4  Correction  due  to  finite  range  length  L 


The  correction  to  Ra  due  to  an  arbitrary  profile  of  small  range  length  L  is  given  by 


AR4  = 


j(q„+q2,) 


^ — r  G2dC  A  q2  (O  _ 
+  q„.  )2  1  > 


4(^11C121  fL2 


(q„  +  q2i) 


J_L2  dC  C  A  q2  (C  ) 


(2.4.1) 


where 


A  q2(z)  =  q2(z)-q2(z) 


(  2.4.2 ) 


and  q2  ( z )  is  the  reference  step  profile.  If  we  choose  the  origin  so  that  the  area  under  the  q2curve 
is  zero,  (  2.4.1)  simplifies  to 


ARa  = 


11^21 


(Qii  +  q2i^ 


}^2 dC C A  q2  (C) 


(2.4.3) 


If,  furthermore,  the  q2  ( z )  profile  is  linear  ( see  Figure  4 ),  we  get 


-  4  On  0.21  Ra  24" 


(2.4.4) 


ra ( 1  - “gy'  )  (i- 


%  >*  wf 


(2.4.5) 
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FIG.  4  Linear  Profile 
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3.  Pulses  and  Wave  Packets  In  Switched  Anisotropic  Plasma  Medium 

3.1  Introduction 

In  section  1,  the  effect  of  switching  a  magnetopasma  medium  on  a  monochromatic  plane  wave  is 
reviewed.  In  this  section  we  consider  the  effect  of  switching  the  medium  on  a  monochromatic 
pulse.  Let  coo  be  the  frequency,  k  the  wave  number,  Lo  the  length,  To  the  duration  of  the 
monochromatic  pulse  traveling  in  z  direction  in  free  space  (  Figure  5  a ).  At  t  =  0  let  the  medium  be 
converted  to  a  magnetoplasma  whose  parameters  are  cob  and  cop.  The  effects  of  these 
parameters  on  the  frequencies  and  amplitudes  of  the  new  B  pulses  will  be  the  same  as  those  on 
the  monochromatic  plane  wave  and  are  discussed  in  [1],  [9],  [12],  At  a  temporal  discontinuity  the 
pulse  length  Lo  is  conserved.  However  the  pulse  duration  is  altered.  A  geometrical  interpretation 
of  the  change  in  the  pulse  duration  for  O,  R  and  X  pulses  is  given  in  the  Figures  5,  6  and  7 
respectively. 

3.2  Pulse  Duration 


The  pulse  duration  and  the  pulse  length  for  the  n  th  wave  are  related  by 


(3.2.1) 


where  vgr  n  is  the  group  velocity  of  the  n  th  wave.  For  the  source  pulse  in  free  space,  the  group 
velocity  is  c  and 


(3.2.2) 


From  the  dispersion  relation,  the  group  velocity  of  the  n  th  wave  (  d  con/  dk  )  may  be  obtained. 
The  pulse  duration  ratio  is  given  by 


An  increase  (decrease )  in  the  group  velocity  produces  a  pulse  compression  expansion). 
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(3.2.3) 


Monochromatic  Pulse 


FIG.  5  (a)  Monochromatic  source  pulse  in  free  space  (b)  Duration  of  O  Pulses  in 
Plasma 
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FIG.  6  Duration  of  R  Pulses 
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FIG.  7  Duration  of  X  Pulses 


6-18 


For  O  waves  in  an  isotropic  plasma  medium  the  dispersion  relation  is 


2  2  2 
or  =  co  +  or 
nop 


(3.2.4) 


*^o1,o2  _  0)0 


(3.2.5) 


For  R  waves  the  dispersion  relation  is  given  by  [20] 


>3  “  COuCO^  -  (  CO^  +  CO^  )  CO  4-  C0^C0u=  0 
n  onvo  p7n  o  & 


2to0(con-cOo) 

3c0n  -2c0ncob-c°o®p 


For  X  waves  the  dispersion  relation  is  given  by 


co^  -  co^  ( co*:  +  cof;  +  2  co!: )  +  ( co*  +  co^  ( cof;  +  co^ ) )  =  0 

n  n'o  b  P  P  O  b  p 


(3.2.6) 


(3.2.7) 


(3.2.8) 


dcon 


2  2  2 

co~  —  co  -  co: 

n  b _ p 


n  2co':-(  co^  +  co^  +  2coj:) 
n'o  b  P 7 


(3.2.9) 


3.3  Numerical  Results  and  Discussion 


The  top  portions  of  Figures  8-12  show  the  variation  of  frequency  of  the  B  pulses  as  a  function  of 
the  frequency  cog  of  the  source  pulse  in  free  space.  The  frequencies  are  normalized  by  choosing 
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1.5 


2 


2.5 


FIG.  11  XI  pulse:  (  a )  pulse  frequency  vs.  source  frequency 
( b )  pulse  duration  vs.  source  frequency 
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the  plasma  frequency  parameter  cop=  1 .0.  The  gyrofrequency  parameter  ©b  is  varied  from  0  to  1 .5 
in  steps  of  0.5.  The  slope  of  the  pulse  frequency  vs.  source  frequency  curve  is  proportional  to 
the  group  velocity  and  the  pulse  duration  is  inversely  proportional  to  the  magnitude  of  this  slope. 
The  bottom  portions  of  these  curves  describe  the  variation  of  the  pulse  duration  of  the  various  B 
pulses.  Pulse  duration  is  normalized  by  choosing  To  =1.  The  R1  and  XI  curves  for  ©b  =  0 
describe  the  01  pulse.  The  following  interesting  points  may  be  noted: 

1.  Pulse  duration  ratio  for  R1  and  XI  pulses  asymptotically  reaches  unit  value  as  ©o  tends  to 
infinity.  For  large  values  of  ©o  the  plasma  has  very  little  effect. 

2.  R2  is  a  reflected  pulse  and  its  graph  for  the  pulse  duration  is  qualitatively  similar  to  the  R1 
graph.  02,  X3,  X4  pulses  are  also  reflected  pulses  and  their  graphs  are  the  same  as  those  for  01 , 
XI ,  X2  pulses  respectively. 

3.  R3  and  X2  pulses  are  generated  only  in  a  switched  anisotropic  plasma  medium  and  their 
behavior  is  strongly  influenced  by  the  ©b  parameter.  Their  pulse  duration  goes  through  a 
minimum  and  the  minimum  point  is  strongly  influenced  by  the  ©b  parameter. 


It  is  suggested  that  the  principle  of  the  change  in  the  pulse  duration  may  be  used  to  diagnose 
dynamically  the  time  -  varying  parameters  of  an  anisotropic  medium. 

The  extension  of  the  results,  to  the  case  where  the  source  pulse  is  in  another  anisotropic 
medium  rather  than  in  free  space,  is  straight  forward.  Replace  ©o  by  ©-|  where  ©-)  satisfies  the 
dispersion  relation  (  4.3.2  )  for  the  first  anisotropic  medium.  The  pulse  duration  Ti  of  the  source 
wave  is  obtained  by  using  vgr  i  in  (  3.2.1  ).  The  pulse  duration  for  the  B  pulses  in  the  second 
anisotropic  medium  may  be  obtained  by  using  d  ©2n  /  d©i  in  (  3.2.3  )  where  ©2n  satisfies  the 
dispersion  relation  ( 4.3.4  )  in  medium  2. 

4.  Green's  Function  and  Perturbation  Technique  for  a  Switched  Anisotropic 

Medium. 

4.1  Introduction 

The  steps  involved  in  obtaining  a  solution  for  profiles  of  finite  rise  time  are  outlined  in  subsection 
2.1.  However,  in  the  anisotropic  case,  the  problem  is  more  difficult  to  solve  since  the  differential 
equation  for  the  fields  are  of  higher  order.  The  technique  is  illustrated  by  solving  the  R  wave 
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problem.  The  formulation  assumes  that  medium  1  is  also  anisotropic.  However  the  illustrative 
example  assumes  that  the  medium  1  is  free  space  (  ©pi  =  0  ). 

4.2  Solution  for  a  step  profile:  R  Wave 

Let  the  source  R  wave  of  frequency  co-j  be  traveling  in  anisotropic  medium  1  whose  parameters 
are  copi  and  cob-  At  t  =  0  the  plasma  frequency  is  changed  to  cop2- 

The  differential  equation  for  the  h  field  for  t  >0  is  given  by  ( 4.2.1  )  in  which  we  use  the  symbol  ho 
as  the  solution  for  the  reference  profile  afp2  ( t ) : 


d3h( 

dt3 


-J°V 


d2h. 


dt‘ 


k2c2 


+  co 


p2. 


dh 


I  _ik2  rZ 


jk^c2cobh0  =  0,  t  >  0  (4.2.1) 


where 


k2  c2  =  n2  co2  (4.2.2a) 


co 


1  - 


pi 


co1  ( co1  “  cob ) 


(4.2.2b  ) 


Equation  (4.2.1  )  subject  to  the  initial  conditions 

M°)=Ho  (4.2.3a) 

fi0(°)  =  J®iHo=  J  jq-  Ho  (4.2.3b) 

h0(0)  =  -  co2  H0  =  -  i^L  H0  (4.2.3c) 

n? 

may  be  solved  by  the  Laplace  transform  method  and  the  result  is 

h0(t)  ( s  +  j oa1 )  ( s  -  jcob)+[co22  +  (n?  -  1  )co2] 

pj  L  - — — — - 

0  s3-jcobs2+  (  n?  co2+  co22)s- jn?  co2cob 
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4.3  Green's  function  for  the  Step  Profile 

Green's  function  is  the  solution  of  ( 4.3.1  ): 

-  j  cob^-^-  +  cof  +  53^  ( t )  1  -  j  n2  co^  cobG  =  8  ( t  -  x )  ( 4.3.1  ) 

dt3  dt2  L  p  J  dt  ii 

The  appropriate  Green's  function,  in  the  domain  -  °o  <  t  <  0  is  built  up  of  functions  exp  ( j  coi  m  t ) 
where  coirn  are  the  roots  of  the  polynomial  equation: 

F1  ( co )  =  co3  -  cob  co2  -  ( n2  co2  +  coj^  )  co  +  n2  co2  cob=  0  (4.3.2 ) 

One  of  the  roots  of  the  cubic  (4.3.2 )  is  coi ,  say 
co1 1  =  co1  (  4.3.3  ) 
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In  the  region  0  <  t  <  the  Green's  function  is  built  up  from  function  exp  ( j  co2nt )  where  co2n  are 
the  roots  of 


F2  (co)  =  co3  -  cob co2  -  ( co^  +  C0p2 )  co  +  cob  =  0  (  4.3.4  ) 


From  causality 

G  (t,t)  =  0,  t  <  t  (4.3.5) 

For  t  >  x  the  solution  may  be  written  by  considering  separately  the  following  regions  of  ( t ,  x ) 
space: 


Case  1  t  >  x,  t  <  0,  x  <  0 

3 

G(t,x)=  £  HAmexp  [  j  co1  m  ( t  —  x )  ]  (4.3.6) 

m  =  1 

HAm  ©ay  be  determined  from  the  requirements  of  the  continuities  of  G  and  the  first  derivative  of 
•  •  * 

G  =  G,  and  a  unit  discontinuity  of  the  second  derivative  of  G  =  G 

at  t  =  x: 


G(x+,x)  =  G(x-,x  )  =  0  (4.3.7a) 

3 

S  HAm=0  (4.3.7b) 

m  =1 

G  (x+,x)  =  di  (x-  x)  =  0  (4.3.8a) 

3 

S  i«1mHAm  =  0  (4.3.8b) 

m  =1 


G  (x+, x)-G  (x“, x)  =  1  (4.3.9a) 

3 


m  =1 


H  =1 

Am 


(4.3.9b  ) 
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Note  that  the  second  term  on  the  left  side  of  (  4.3.9a )  is  zero.  From  ( 4.3.7b ),  (  4.3.8b  )  and  ( 
4.3.9b  ), 


HAm  - - 1 -  (4-3-10> 

3 

n  (®1m“fi»1p) 

p  =  1 ,  p*  m 

Case  2t  >  x,  t  >0,  t  <0 

The  impulse  source  in  medium  1  generates  the  three  waves  given  by  (  4.3.10  ).  These  waves 
travel  and  encounter  the  temporal  discontinuity  of  the  medium  at  t  =  0.  Each  of  these  waves 
generates  three  waves  on  encountering  the  temporal  discontinuity  of  the  medium  at  t  =  0  ,  the 
scattering  coefficients  for  unit  excitation  being  given  by  ( 4.2.8  ).  Since  the  frequency  of  the  m  th 
wave  is  oo-j  m  and  the  excitation  strength  of  the  m  th  wave  is  HAm  exp  ( -  j  ©i  m  t ).  the  Green's 
function  may  be  written  as 

3  3 

G(t,T)=  X  HAm  exP  ("  j“lmX)  X  S2n1m  eXP  [  j  w2  n  1  m  X]  (4-3.11) 

m  =  1  n  =  1 


In  the  above,  co2n1m  is  the  n  th  root  of  (  4.3.4  )  when  coi  is  replaced  by  coim-  H2n1m  is  the 
scattering  coefficient  of  the  n  th  wave  of  frequency  C02n1m  excited  in  medium  2  by  the  m  th  wave 
of  frequency  coi  m  in  medium  1.  If  co2n1m  is  positive  the  scattering  coefficient  is  a  transmission 
coefficient  and  if  (02nl  m  is  negative  the  scattering  coefficient  is  a  reflection  coefficient.  It  may  be 
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noted  that  as  n  varies  from  1  to  3  and  m  varies  from  1  to  3,  (02n1m  will  have  9  values.  In  actuality 
there  are  only  three  distinct  values  for  a  given  k  as  can  be  seen  from  ( 4.3.4 )  and  ( 4.3.12c ): 

k2c2  =  n2ocr  =  n2  or  (4.3.12c) 

11  1  m  1  m  ' 

The  other  6  values  coincide  with  one  of  the  three  distinct  values. 

Case  3  t  >x,t  >  0  ,  x  >0 

In  this  case,  the  impulse  occurs  in  medium  2.  The  generated  waves  do  not  encounter  the 
temporal  discontinuity  at  t  =  0.  Therefore 

3 

G(t,x)=  I  HBm  exp  [jco2  m(t-x)  ]  (4.3.13) 

m  =  1 

where 

HBm  - - - - ' -  (4.3.14) 

11  <“2m-®2p) 

p=  1,  p*  m 


Explicit  expressions  for  G  (t,  x )  valid  for  various  regions  of  ( t,  x )  plane  are  given  in 
( 4.3.15)  on  the  next  page  . 

4.4  Perturbation  Technique 


Proceeding  on  the  lines  of  [18  ]  for  the  isotropic  case  we  can  show  that  the  n  th  order  correction 
term  hn  may  be  obtained  by  solving  ( 4.4.1  ): 


dhr 

dt3 


d2hn  9  p  ,  ,1  dhn  .  ^  o 

+  [ni  +  “p  (*)]  "cTT  -  j  ni  05 1  “bhn 


9  d  hn  i 
-  (t)  "-1 


dt 


(4.4.1  ) 


where 
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(4.4.2a) 


h-  h0  +  h1  +  h2  + - hn  +  -  -  - 


and 


Acop(t)  =“p(t)  “©pH)  (4.4.2b) 


The  first  order  correction  term  may  then  be  written  as 


hi  (t)  =  j  dx  [-  A  C0p  ( x )]  h  o  ( x )  G  ( t ,  x)  (4.4.3) 


Since 


h0(t) 

~rn~ 


=  exp  [jco1 1]  ,t<  0  (4.4.4a) 


h0(t) 


H -  =  X  S2neXP  U^2nX]’X>  0  (4.4.4b) 


n  =  1 


for  a  finite  rise  time  ( see  FIG.  3b ) 


'i(t)  r° 

L; 


dx  [  -  A  o)p  (x )  Jjrn,  exp[jto,T] 


“  sJ 

X  exP  [“  j  ^Mm1]  X  S2nim  exP  j  C02n1mt] 


[m  =  1 

n  =  1 

•  Tr2 

+ 

• 

d  x  [  -  A  cop  ( x )  ] 

0 

3 

3 

Zjw2nS2n  exp  [  jC02nX] 

X  HBm  exp  [  jco2m(t-x)] 

n  =  1 

m  =  1 

(  4.4.5) 
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4.5  Sample  Calculation  for  a  Linear  Profile 


A  linear  profile  for  cop2  ( t )  function  is  assumed  .  The  computations  for  the  illustrative  example  are 
simplified  by  choosing  Tr-|=  0  and  Tr2  =  Tr.  This  will  entail  some  loss  of  accuracy  if  the  exponential 
functions  exp  (x  )  are  approximated  by  only  two  terms  ( 1+x  )  in  (  4.4.5  ).  However  for  a  linear 
profile  the  integration  can  be  carried  out  analytically  without  making  such  an  approximation  and  the 
loss  of  accuracy  is  thus  avoided.  For  the  linear  profile  with  Tri  =  0 


A  C0f>  ( t )  =  -  ( ®p2  “  cOp-i  )  ( i  -  V"rr ) 


(4.5.1) 


and  from  (4.4.5) 


mu 

H0 


dX  (  C0p2~C0pl  )  ( 1  ~  VTr) 


Zi®2nS2n  exp  [  j®2nX] 


X  HBm  exp  [  j  C02m ( t - t  )  ] 


(4.5.2) 


For  the  R  wave  the  left  side  of  ( 4.5.2)  may  be  expressed  as 


Mt) 


=  A H BN exp  (jco21t)+  HBP1  exp  ( j co22 1 )  +  HBP2exp  (jco23t) 


(4.5.3) 


where  ©21  is  taken  as  negative  (  R2  wave) ,  ©22  positive  (R1  wave ),  and  ©23  positive  (  R3  wave 
)  and  less  than  ©22-  Thus  A  Hbn  is  the  correction  to  the  amplitude  of  the  magnetic  field  of  the  R2 
wave  due  to  the  finite  rise  time  Tr.  Equating  the  coefficients  of  exp  (  ©2n  t )  on  the  left  hand  side 
and  the  right  hand  side  of  ( 4.5.2)  we  get  an  expression  for  A  Hbn 


H„l 


ahbn=  ( «°p2—  “pi )  hbi  dx  (1  ~yTr)exp(  —  j  CO21  X)  X  j  ®2nS2n  exp  [  jco2nx] 


(4.5.4) 


Evaluating  this  definite  integral  on  the  right  side  we  get 
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f 


AHBN-  (03p2”03pl  )  HB1 


S2i  j  co21  Tr 


co 


22 


23 


22 _ j  fl>22  ^XP  [  j  (  ®22  —  ®21  )  r  ~  ^  ] 


C022  ©21 


TT 


(O22  C02  *| 


)J 


C°23  _  jC023exP  [j(w23-C022  )Tr-1  1 

2 


®23  “21 


T,  (' 


^23  ®21j 


(4.5.5) 

The  normalized  magnetic  field  ( including  the  correction  term  of  the  R2  wave  as  a  function  of  Tr  is 
given  by 


HR2(Tr)=  HB1  +  AHBN(Tr) 


(4.5.6) 


and  the  power  reflection  coefficient  p  of  the  R2  wave  is  given  by 


Re(eR2hR2) 
Re(e1  h*  ) 


CO 


HR2 


2 


(4.5.7) 


Figure  13  shows  p  vs.  Tr.  for  various  values  of  the  gyrofrequency.  Here  copi  is  assumed  to  be 
zero.  Since  the  medium  1  is  free  space  the  source  frequency  coi  is  designated  as  coo  and  the 
asymptotic  plasma  frequency  in  the  second  medium  coP2  is  designated  as  cop.  All  frequencies 
are  normalized  by  assuming  fo  =  coo  /  2jc  =  1 .  Thus  Tr  is  normalized  with  respect  to  the  period  of 
the  source  wave.  The  plasma  frequency  fp  =  cop/27r  is  taken  to  be  1 .2. 


The  solid  line  curve,  which  has  the  parameter  fb  =  0,  is  the  graph  for  the  isotropic  case. 
Comparison  of  this  curve  with  the  results  obtained  from  the  exact  solution  for  the  isotropic  case  for 
a  linear  profiled  8]  shows  that  the  perturbed  solution,  obtained  here,  is  valid  for  Tr  <  0.2.  For  larger 
values  of  Tr,  the  solution  can  of  course  be  improved  by  taking  more  terms  in  the  perturbation 
series.  For  very  large  values  of  Tr  (  slow  switching  ),  the  perturbation  solution  must  be  based  on  a 
different  Green’s  function.  Such  a  Green's  function  has  to  be  developed  by  using  a  different 
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reference  profile  that  is  appropriate  to  slow  switching.  A  starting  point  for  such  a  development  is 
the  WKB  solution  [14],  [15]. 

5.  Application  to  Lightning  Induced  Ionospheric  Effects 

5.1  Introduction 

Lightning  induced  effects  cause  a  temporary  enhancement  of  ionization  [  21],  Such  transient 
plasmas  have  a  time  -  varying  plasma  frequency  cop  ( t )  with  a  rise  time  Tr.  There  is  considerable 

interest  in  the  ionospheric  community  to  investigate  the  recently  discovered  '  Sprites  ' 
phenomena  [  21]  -  [23]  which  are  red  emissions  in  the  lower  D  region  induced  by  the  lightning 
discharges  from  a  cloud  to  the  ground.  The  preliminary  indications  are  that  the '  Sprite  ’  is  a  plasma 
with  electron  density  enhanced  by  about  a  10  to  103  /  cc.  The  rise  time  is  about  100 
microseconds.  However  the  bulk  of  the  ionization  may  be  taking  place  in  a  shorter  time  duration  ( 
see  Figure  3  )  and  the  effective  rise  time  may  be  much  less  than  100  microseconds.  The 
evolution  of  the  electron  density  in  the  '  Sprite  '  can  be  diagnosed  by  tracking  from  the  ground  the 
instantaneous  frequency  of  the  upshifted  reflected  wave  with  a  suitable  instrument  [24],  If  the 
source  frequency  is  very  high  (  as  in  the  case  of  incoherent  scatter  radar  ),  the  reflection 
coefficient  is  negligible  and  the  wave  can  not  be  detected  by  the  most  sensitive  receiver.  A 
source  wave  in  VLF  is  suitable  and  in  this  case  the  rise  time  is  comparable  to  the  period  of  the 
source  wave. 

5.2  Illustrative  example 

The  data  for  the  illustrative  example  is  chosen  so  as  to  be  relevant  in  principle  to  VLF  diagnostics 
of  the  time  evolution  of  the  electron  density  induced  by  the  lightning.  The  source  frequency  fo  is 
chosen  to  be  25  kHZ  in  the  VLF  range.  The  plasma  frequency  is  assumed  to  increase  from  fp-|  =  0 
to  fp2  Hz  in  a  rise  time  of  Tr  seconds.  Figure  14  shows  a  three  dimensional  picture  of  the  square 
of  the  reflection  coefficient  magnitude  ( |  R  |2  )  obtained  by  an  exact  solution  for  a  linear  profile  of 
the  electron  density  [18]  with  a  rise  time  of  Tr  seconds.  The  graph  shows  that  the  reflected  wave  is 
of  sufficient  strength  for  a  VLF  source  wave  and  can  be  detected  by  a  suitable  receiver. 

A  question  whether  the  sprite  is  a  plasma  or  whether  the  red  emissions  are  caused  by  processes 
other  than  ionization  may  be  answered  by  diagnosing  the  B  waves  or  pulses 
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0.00003 


FIG.  14  Isotropic  Plasma:  3-D  graph  of  the  square  or  the  reflection 
coefficient  amplitude  for  a  linear  profile  obtained  through 
the  exact  solution.  The  independent  variables  are:  rise  time 
( Tr )  in  sec.  and  plasma  frequency  ( top )  in  rad  /  sec. 
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5.3  Other  Applications 


In  addition  to  the  'Sprites',  observation  of  several  other  lightning  induced  geophysical 
phenomena  like  'Blue  Jets',  'Elves'  are  reported  in  the  literature.  Also  ionospheric  disturbances 
in  E  and  F  layers  require  the  consideration  of  dynamic  changes  in  the  parameters  of  a  magnetized 
plasma..  Alteration  of  the  pulse  duration  in  an  anisotropic  media  discussed  in  section  3  may  be  a 
good  diagnostic  measurement  for  these  time  -  varying  plasmas.  These  aspects  are  being 
explored  in  active  collaboration  with  my  focal  point . 

6.  Conclusion 

Using  one  dimensional  theory,  several  aspects  of  electromagnetic  wave  transformation  in  a 
dynamic  magnetized  plasma  are  considered.  They  may  be  grouped  under  the  following 
categories 

1 .  Effects  of  simultaneous  variation  of  the  electron  density  in  space  and  time 

2.  Alteration  of  pulse  duration  in  a  dynamic  magnetized  plasma  medium 

3.  Green's  function  and  perturbation  technique  for  a  switched  time  -  varying  magnetoplasma 
medium 

4.  Application  of  the  theoretical  results  developed  in  1  -  3  to  the  investigation  of  lightning 
induced  ionospheric  effects.  This  effort  will  continue  in  active  collaboration  with  my  focal 
point . 

One  dimensional  models  have  shown  that  the  dynamic  changes  of  the  parameters  of  a 
magnetoplasma  have  a  remarkable  effect  on  a  source  wave.  The  next  step  is  to  investigate  in 
more  depth  the  realistic  three  dimensional  problems  with  the  help  of  Finite  Difference  Time 
Domain  Numerical  Technique. 
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THERMAL  ANALYSIS  FOR  THE  APPLICATION  OF  HIGH  POWER  LASERS  IN 
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and 
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University  of  Central  Florida 

Abstract 

The  development  of  high  power  Chemical  Oxygen-Iodine  Laser  (COIL)  by  US  Air  Force  Phillips 
Laboratory  makes  the  large-area  materials  processing  possible.  The  flexible  transmission  of  COIL  beam 
by  optical  fibers  enhances  its  application.  One  of  the  applications  of  COIL  large-area  materials  processing 
is  paint  removal  from  aircraft,  which  is  considered  as  a  most  potential  technique  to  replace  the  expensive 
and  waste-generated  conventional  chemical  and  mechanical  methods.  In  order  to  understand  the  new 
technique  better,  a  mathematical  modeling  of  paint  removal  by  COIL  beam  from  aircraft  is  developed.  The 
cross-sectioned  shape  of  paint  after  laser  irradiation  and  the  temperature  of  aircraft  body  surface  are  calculated 
by  the  modeling.  The  effect  of  laser  parameters  on  the  paint  removal  process  is  also  discussed.  An 
optimum  laser  power  at  which  the  relative  more  paint  is  removed  and  the  surface  temperature  is  not  so  high 
is  found  and  it  is  affected  by  other  parameters  but  is  not  relative  to  laser  modes.  Based  on  the  results 
derived  from  the  modeling,  uniform  distribution  of  power  density  is  suggested  for  laser  large-area  materials 
processing. 
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THERMAL  ANALYSIS  FOR  THE  APPLICATION  OF  HIGH  POWER  LASERS  IN 
LARGE-AREA  MATERIALS  PROCESSING 

Jian  Xie  and  Aravinda  Kar 

Introduction 

One  of  the  applications  of  laser  large-area  materials  processing  is  laser  paint  removal  from  aircraft.  All 
aircraft  with  painted  surfaces  must  be  stripped  and  repainted  on  an  average  four-year  cycle  [1],  Chemical 
stripping  and  mechanical  abrasion  are  most  often  used  for  paint  removal  from  aircraft  [1,2].  However,  the 
use  of  toxic  chemicals  and  considerable  hand  scraping  result  in  a  large  expenditure  of  personnel  hours  and 
the  creation  of  serious  air  and  water  pollution.  Paint  removal  is  one  of  the  seven  most  costly  elements  cf 
maintaining  US  Naval  aircraft  [3].  The  resulting  waste  disposal  problem  has  become  increasingly  important 
in  recent  years  [1-3]. 

Using  lasers  to  remove  surface  coatings  from  aircraft  has  been  a  promising  technique  since  the  early  1980s. 
Cost  was  the  first  drive  force  for  developing  the  technology.  Dramatic  cost  saving  will  be  realized  while 
providing  enhanced  safety  for  maintenance  personnel  [3].  A  survey  of  all  potential  paint  removal  processes 
in  1986  revealed  that  laser  technique  had  more  potential  saving  than  any  other  processes.  Laser  paint 
removal  eliminates  the  environmental  hazards  associated  with  chemical  methods,  reduces  waste  removal  to 
only  paint  residue,  and  allows  removal  from  all  types  of  aircraft  substrates.  Waste  can  be  minimized  to 
0.5%  of  that  generated  by  present  chemical  stripping  techniques,  and  contained  and  concentrated  in  an 
environmentally  acceptable  form.  In  addition,  laser  paint  removal  will  avoid  problems  associated  with  the 
effects  of  incompletely  removed  or  neutralized  corrosive  substances  remaining  in  aircraft  faying  surfaces  or 
seams  after  chemical  removal  [1].  Non-laser  automated  removal  methods  such  as  aqua  stripping,  dry  ice 
crystals,  granulated  plastic  and  abrasive  wheat  germ  pellets  may  cause  surface  deformation,  surface 
hardening,  and  other  intrusive  and  impact-related  problems  [1].  Another  reason  to  use  laser  paint  removal 
is  that  the  removal  of  new  high-performance  coatings  by  chemicals  (  usually  methylene  chloride)  becomes 
increasingly  difficult.  Some  of  new  coatings  are  resistant  to  removing  and  you  can  begin  to  realize  the 
dilemma  now  facing  the  aerospace  industry  [2,4,5].  This  technique,  as  a  secondary  consideration,  may 
subsequently  be  applied  to  public  transit  vehicles  including  trains,  buses,  and  subway  cars  and  extend  to 
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highway  bridges,  fuel  and  water  tanks,  pressure  containers,  industrial  steel  buildings  and  commercial  ship 
hulls.  The  commercial  potential  for  lasers  and  other  pulse-power  methods  of  coating  removal  is 
considerable.  It  would  resulting  in  a  savings  of  approximately  $3  billion  per  year  amount  relative  to 
conventional  sand  blasting  [2].  U.S.  Navy  awarded  a  contract  to  build  two  automated  laser  paint  stripping 
(ALPS)  systems  for  the  stripping  of  F-14  Fighter  and  H-3,  H46,  and  H53  helicopters  [1], 

Both  pulsed  [1-3, 5, 6]  and  CW[2,3]  lasers  are  used  for  paint  removal.  C02,  Nd:YAG  and  Excimer  lasers  are 
major  types  of  laser  for  the  application  [1 -3,5,6].  Young  et  al.  compared  the  performance  of  coating  removal 
with  several  high  power  techniques  including  pulsed  and  CW,  C02  and  Nd:YAG  lasers,  and  electron 
beam.  The  results  show  that  CW  Nd:YAG,  pulsed  Nd:YAG  and  C02  lasers  have  good  performance  in 
removal  efficiency  [2].  Raiber  et  al.  investigated  the  effect  of  energy  density,  pulse  frequency  as  well  as 
other  laser  parameters  on  ablating  process  of  paint  from  aerospace  materials  with  a  XeCl-laser  [6]. 
Schweizer  and  Werner  introduced  an  industrial  2  kW  TEA  C02  laser  system  for  paint  removal  from  aircraft 
[5].  The  description  of  other  laser  depaint  systems  can  be  found  in  Refs.  1,  3  and  7.  Hickey  and  Hise 
developed  a  color  image  processing  and  vision  system  for  an  automated  laser  paint  stripping  system,  which 
could  discriminate  between  substrates  of  various  colors  and  textures  and  paints  ranging  from  semi-gloss 
grays  to  high  gloss  red,  white  and  blue  [7].  Stubbier  et  al.  investigated  the  possibility  of  Excimer  laser 
removal  of  hard  coatings  such  as  TiN  or  WC  from  tools  [8].  Li  et  al.  demonstrated  the  technical  feasibility 
and  basic  phenomena  of  laser  removal  of  embedded  contamination  down  to  depths  of  0.1-4  mm  thick  in 
construction  materials  such  as  concrete,  brick,  plaster/morter,  stones  and  stainless/carbon  steels  [9]. 
Pantelakis  etal.  tested  the  mechanical  behavior  of  2024  Al  alloy  subjected  to  paint  stripping  by  Excimer, 
C02,  Nd:YAG  laser  radiation  and  plasma  etching  [10].  Their  results  show  that  there  is  no  significant 
degradation  in  yield  and  ultimate  tensile  strength,  but  in  tensile  ductility  and  toughness.  The  highest 
degradation  results  from  the  use  of  the  plasma  etching  and  Excimer  laser,  and  the  lowest  reduction  from 
Nd:YAG  and  C02  lasers.  There  is  also  a  considerable  extension  in  fatigue  life. 

To  make  the  technique  possible  and  improve  the  productivity,  the  high  power  laser  is  required  [1,5],  The 
use  of  C02,  Nd:YAG  and  Excimer  lasers  for  this  application  has  been  reported  [1 -3,5,6],  The  chemical 
oxygen-iodine  laser  (COIL)  may  be  another  good  choice  for  paint  removal  because  it  has  a  short  wavelength 
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(A,=1.315  jim)  close  to  Nd:YAG  laser  (X=1.06  pm)  and  higher  power  than  Nd:YAG  laser.  COIL  can  be 

transmitted  by  optical  fibers  instead  of  the  mirror  and  tube  system  for  C02  laser.  Unfortunately,  no 
application  of  COIL  in  paint  removal  was  reported. 

The  chemical  oxygen-iodine  laser  (COIL)  is  still  under  development  and  the  Phillips  Laboratory  at  the 
Kirtland  Air  Force  Base  is  carrying  out  pioneering  research  in  this  area.  McDermott,  et  al  invented  the 
COIL  in  1977  at  the  Air  Force  Weapons  Laboratory  which  is  now  a  part  of  the  Phillips  Laboratory  [11].  It 
is  the  shortest  wavelength  (X=  1.315  mm),  high  power  chemical  laser  ever  developed.  Truesdell,  et  al 

demonstrated  the  engineering  scalability  of  a  supersonic  gas  flow  COIL  with  the  39  kW  RotoCOIL  device 
at  the  U.  S.  Air  Force's  Phillips  Laboratory  in  1989  [12].  This  device  used  compact,  rotating-disk, 
wetted-wall  chemical  reactors  to  generate  large  quantities  of  singlet  delta  oxygen,  and  then  mixed  the 
oxygen  with  iodine  using  a  stacked  supersonic  nozzle  array  [13].  Fujii,  et  al  [14,15]  and  Wani,  et  al  [16] 
at  Kawasaki  Heavy  Industries,  Ltd.,  in  Japan  demonstrated  the  operation  of  a  COIL  for  long  duration  with 
the  operation  of  a  1  kW  subsonic  gas  flow  COIL  for  2  hours.  At  the  Phillips  Laboratory,  sustained  high 
power  operation  of  a  supersonic  COIL  was  demonstrated  with  operation  at  9  kW  for  2  minutes  [17]  and  at 
430  W  for  13  minutes  on  a  compact  COIL  test  stand  [18].  Further  details  on  COIL  development  and  on 
the  theoretical  operation  of  COIL  can  be  found  in  Truesdell,  et  al  [12]  and  Avizonis  and  Truesdell  [19], 
respectively.  Fujioka  reviewed  the  transmission  of  a  COIL  beam  through  optical  fibers  in  1991  to  examine 
the  industrial  applications  of  such  beam  delivery  systems  [20]. 

Some  work  on  further  application  of  the  COIL  has  been  carried  out  recently  in  Phillips  Laboratory.  Under 
the  AFOSR  1995  and  1996  summer  research  program,  Dr.  A.  Kar,  from  Center  for  Research  and  Education 
in  Optics  and  Lasers  in  University  of  Central  Florida,  worked  with  Dr.  J.  E.  Scott  and  Dr.  W.  P.  Latham 
from  Phillips  Laboratory.  During  these  time,  an  extensive  theoretical  and  experimental  research  were 
carried  out  to  understand  some  of  the  basic  principles  of  thick  section  cutting  with  multimode  and 
rectangular  COIL  beams.  Two  mathematical  models  were  developed.  One  of  them  is  based  on  a  simple 
energy  balance  to  account  for  the  changes  in  average  temperature  of  the  substrate,  which  is  classified  as  the 
average  temperature  model.  The  second  model  is  based  on  a  more  accurate  energy  balance  to  account  fcr 
the  pointwise  variation  of  temperature  in  the  substrate,  which  is  classified  as  the  heat  conduction  model.  In 
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the  experimental  studies,  various  process  parameters  such  as  the  laser  power,  assist  gas  nozzle  size  and  the 
assist  gas  pressure  were  varied.  Some  of  the  results  of  these  studies  have  been  already  presented  in  the  form 
of  a  poster  in  a  workshop,  and  the  other  data  have  been  published  in  Journal  of  Applied  Physics  and 
Journal  of  Laser  Applications  and  International  Conferences.  These  results  are  also  summarized  in  reports 
that  have  been  submitted  to  the  Research  and  Development  Laboratory  (RDL)  of  AFOSR  program  office. 

Although  the  laser  has  been  consider  as  a  most  potential  technique  for  paint  removal,  the  current  state  of  the 
technology  indicates  that  they  are  a  decade  or  more  off  in  the  future  [4],  The  proper  understanding  for  the 
laser  paint  removal  mechanism  is  essential  and  important.  In  this  study,  a  mathematical  model  of  COIL 
paint  removal  from  aircraft  body  (aluminum  alloy)  was  developed.  The  mechanism  of  laser  paint  removal 
could  be  understood  and  the  effect  of  laser  modes  and  parameters  on  paint  removal  process  could  be 
determined  by  this  model.  The  temperature  rise  on  aircraft  surface  is  also  calculated,  which  is  particularly 
important  because  we  wish  to  remove  the  paint  without  damaging  the  aircraft  body. 

Mathematical  Model 

The  beam  that  comes  out  of  a  typical  COIL  cavity  is  rectangular  in  shape,  and  is  consisted  of  several 

Laser 

beajy,  Laser  beam  scanning 

velocity,  v 

I  - O 


z 

FIG.  1.  Laser  beam  geometry  and  coordinate  system  for  the  mathematical  model. 
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modes.  The  heating  characteristics  of  a  multimode  beam  is  different  from  that  of  a  Gaussian  mode  (TEMoo) 
beam.  The  geometric  configuration  of  COIL  paint  removal  is  showed  in  Fig.  1 .  A  rectangular  laser  beam 
with  a  half  width  of  wi  and  a  half  length  h  moves  in  a  velocity  of  v  in  x  direction.  The  paint  under  the 
laser  irradiation  is  heated  and  removed  by  vaporizing. 


From  physical  point  of  view,  the  laser  paint  removal  is  a  process  of  melting  and  vaporizing  of  paints  [2,3 
5,6,9].  To  make  the  problem  clearer,  we  first  consider  the  one-dimensional  melting  and  vaporizing  of  paint 
under  laser  irradiation  with  a  power  density  I  as  shown  in  Fig.  2.  ft  and  ft  are  vaporizing  depth  and 
melting  depth,  respectively,  which  change  with  laser-paint  interaction  time  t.  The  arrows  in  Fig.  2 
indicate  the  heat  flux  in  or  out  at  boundaries.  The  substrate  is  aircraft  body  made  of  aluminum  or 
aluminum  alloy.  I„t is  the  heat  conduction  loss  by  substrate. 


FIG.  2.  Heat  flux  in  paint  melting  and  vaporizing  process  at  a  site  with  laser  intensity  I 


The  phase  change  occurs  in  the  liquid-vapor  and  solid-liquid  interfaces  and  can  be  described  by  Stefan 
condition  as  follows 


Ull, 

p Lv~dT~I  qLV 
T  dH 2_  , 

P  An  ^  ^  Lm  Lm 


at  Z=ft 


at  Z=ft 


(1) 

(2) 
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where  Lm  and  Ly  are  the  latent  heats  of  melting  and  vaporizing,  respectively.  In  quasi-steady  state  of  paint 
melting  and  vaporizing,  the  liquid-vapor  interface  is  moving  down  in  a  same  speed  as  solid-liquid  interface 
[21],  e.g., 


dH ,  _  dH2 


(3) 


dt  dt 

In  this  case,  the  temperature  distribution  in  liquid  layer  is  almost  linear  and  the  heat  transferred  into  the 
liquid  (q'u,)  layer  should  be  same  as  the  heat  transferred  out  the  liquid  layer  (<7^) 

Q  Lm~  Qlv  (4) 

Rearranging  the  Eqs.  (1-2)  based  on  Eqs.  (3-4),  we  obtain 


„  _  +  LvqLn 

qiV~  Lm+Ly 


(5) 


In  the  practical  paint  removal  process  in  Fig.  1  (quasi-steady-state  two-dimensional  problem),  the  above 
equations  are  still  available  by  changing  the  interaction  time  t  to  locations  in  x  axis 
/,  -X 


(6) 


The  heat  flux  quu  can  be  taken  as  because  the  energy  used  to  rise  the  temperature  of  the  solid  paint  is 
negligible  because  it  is  small  compared  to  the  large  amount  of  latent  heats  of  melting  and  vaporizing. 
Considering  the  effect  of  the  different  interaction  time  at  different  locations,  we  can  assume 

l-x 


<h»,  =  l 


sub 


21 


(7) 


Since  the  linear  temperature  distribution  in  liquid  layer  in  quasi-steady  state,  the  heat  flux  qLV  can  be 
expressed  by 


T  -T 

Qlv  =  k,  —r  m 


h2-hx 


(8) 


The  thickness  of  liquid  layer  can  be  derived  from  Eqs.  (5),  (7)  and  (8) 

k,(L,+LJ(Tr-TJ 


Substituting  Eqs.  (5-7)  into  Eq.  (1)  and  integrating  it,  the  vaporizing  depth  is  given  by 


(9) 
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«, =  .v,  ,j oo) 

pv(Ly  +  LJJx  2 7, 

We  need  to  know  the  intensity  distribution  of  COIL  beam  for  the  calculating  of  vaporizing  depth.  The 
laser  intensity,  /,  is  given  by 

M  N 

/=££/»  oo 


where  M  and  N  are  the  largest  mode  number  of  the  beam  in  the  x  and  y  directions,  respectively.  Imn  is  the 
intensity  of  the  rectangular  beam  of  m-th  and  n- th  order  modes,  which  is  given  by 


=  hK 


ex\T«r\< 


where  I0  = 


n2m*nm!n!ll0wl0 


and  Pmn  is  the  total  power  of  the  laser  beam  of  m-th  and  n-th  order  modes,  ho  and  wl0  are,  respectively,  the  half- 
length  and  half-width  of  the  Gaussian  mode.  If  h  and  wt  are,  respectively,  the  half-length  and  half-width  of  the 
spot  for  the  beam  with  TEMmn  mode,  lt  and  w;  are  related  to  Gaussian  spot  size  by  the  expressions, 

lj  =  /10V2wj~hl  and  w,  =  Wl0v2rt  + 1.  A  is  the  absorptivity  of  the  workpiece.  Hm  and  H„  are, 
respectively,  m-th  and  n-th  degree  Hermite  polynomials  for  the  m-th  and  n-th  order  modes  of  the  laser  beam. 
An  (m+  7)-th  degree  Hermite  polynomial  in  x  can  be  written  as 

HmX(x)  =  2xHm(x)-2rnHm_i(x)  (14) 

where  H0(x)  =  1  and  Hi(x)  =  2x. 


The  heat  flux  reached  to  substrate  surface,  Imb,  has  similar  intensity  distribution  with  I  and  it  is  the 
difference  between  original  heat  input  by  laser  beam  and  the  heat  dissipated  by  melting,  vaporizing  and 


heating  of  paint,  so 


7  —  //  )  r/2  ^2x  jj 2  “'fey 

1  sub  ~  (i  sub/O11  m  ,  11  n 


2 y2 

exp - - 

l  W,  n 
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Where  (Isub)o=I0 


(16) 


1  — 


p[Ly  +  Lm  +  cP(Ty  T0) 


dH , 
dt 


aYLp- 

J  —  _ m  n _ 

(2l,)(2w,) /^(2m+  J)(2n  +  T) 

The  (dHi/dt)  can  be  represented  approximately  as  (/// r),  where  the  interaction  time 


V 


(17) 


(18) 


Therefore,  the  heat  flux  reached  to  substrate  surface  can  be  represented  by  both  Eq.  (15)  and  the  following 
equation 


(I sub) o  ~  h 


1- 


P[Ly  +  Lm  +  Cp  (Ty  TJ] 

A-J(2m+\)(2n  +  vZS'- 

m  n 


(IwJvH 


(19) 


So  far,  the  distribution  of  laser  intensity  I  and  heat  flux  in  substrate  surface  I,Uh  have  been  obtained  and  the 
vaporizing  and  melting  depths  can  be  calculated  numerically  from  Eqs.  (9,10). 


During  the  laser  irradiation,  a  portion  of  the  laser  energy  is  dissipated  by  paint  removal  and  the  rest  is 
conducted  to  substrate  (aluminum  alloy)  which  causes  the  temperature  rise  of  substrate.  The  surface 
temperature  is  what  we  pay  more  attention  to  because  we  wish  to  remove  the  paints  completely  without 
damaging  the  substrate  (no  melting  occurs).  The  temperature  distribution  in  substrate  is  calculated  based 
on  a  3-dimensional  heat  conduction  model  subjected  to  the  heat  flux  as  follows. 


The  governing  equation  for  the  temperature  distribution  in  substrate  is  given  by 


a 


( d2T  d2T  d2T 


dx2  dy 2  dz2 


dT 

dx 


(20) 


for  — oo  <x  <°°,  —  °°  <  y  <  °°,  and  0  <  Z  <  oo.  Here  a  is  the  thermal  diffusivity  of  the  workpiece, 
which  is  defined  by  a  =kmi/(pc),  where  kmll  and  c  are  the  thermal  conductivity  and  specific  heat  of  the 
substrate  respectively,  v  is  the  velocity  of  the  laser  beam  in  the  x  direction.  Eq.  (20)  is  written  in  a 


7-10 


moving  Cartesian  coordinate  system  whose  origin  lies  at  the  center  of  the  beam.  The  temperature  T,  which 
represents  T(x,y,z),  in  Eq.  (20)  is  the  dimensionless  temperature  of  the  workpiece,  and  is  defined  by 
T  -T 

T  =  — 1 — (2D 
T.-T, 

where  7),  T0  are  the  actual  and  initial  temperature  of  the  substrate  respectively.  Tm  is  the  melting  point  of 


paints.  The  boundary  conditions  for  Eq.  (20)  are  given  by 
T  0  as  x  — »  ±°o 
T  — »  0  as  y  — »  ±°° 

T  — >  0  as  z  — >  °° 


(22a,  b) 
(22c, d) 


ar____L_ 
a»  r„-rc 


■/„,(.  at  z  =  0 


let  us  define 


T  =  T2  exp(-bx),  where  b=  — 

2a 

Substituting  Eq.  (23)  into  Eq.  (20),  we  obtain 


3x2  3y2  3 z1 


The  boundary  conditions  for  T2  are  same  as  those  given  by  the  conditions  (22a-e).  However,  due  to  Eq. 
(23),  boundary  condition  (22f)  takes  the  following  form: 

^  =  -11  anJm  ( x)gn( y)  at  z  =  0  (25) 


amn  = 

mn 


sub)  0 

k(T.-T,) 


f.(*)  =  Hm  —  \exdbx-  — 


lil v)  {  2 y’ 

g„(y)  =  H„  —  exp - - 

^  ji  1  ii,*- 
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Using  the  following  Fourier  transform  pair 

OO  OO 

Integral  transform:  T2(p,q,z)=  J  dx  exp(ipx)  J  dy  exp(iqy)T2(x  ,y  ,z)  (27a) 

— OO  — OO 

|  OO  OO 

Inversion  formula:  T2(x,y,z)  =  —  J dpexp(-ipx)  J dq exp(-iqy)T2( p,q,z)  (27b) 

the  solution  of  Eq.  (24)  that  satisfies  the  boundary  condition  (25)  and  the  conditions  as 
X  — >  ±°°,  y  — >  ±oo,  artdz  — >  00 ,  can  be  written  as 


1  M  N  - 

T2(x,y,z)  =  —Yj  S  °mn  J  dp  exp(-ipx)fm  ( p) 


An 


m=0  n=  0 


~  _  expl- 

1  do  exn(  iov )  o  (n )  ' 

\Jp2  +  q2  +  b2zj 

J  /£«(  H/  r 

v2  +q2  +  b 2 

(28) 


where  fm(p)  ar*d  gn  (q)  are  Fourier  transforms  of  fm(x)  and  gn  (y),  respectively,  which  are  given  by 

OO 

fjp)=jdxexp(  ipx)fm  ( x)  (29a) 


gn(q)=\dy  exp(  iqy)g„(y) 


Noting  that  the  Fourier  inverse  transform  (F1)  of 


(29b) 


exp 

^  p2  +q2  +  b2  zj 

- 

Tp 

2+q2+b2 

is  given  by 


F' 


exp{^ 

^p2  +q2  +b2  z  j 

Tp 

2  +q2  +  b 2 

1  °° 

=  7 T  jdpexp(-ipx) 


00 

J  dq  exp(-iqy)gn(q) 
j  exp{-b^jr2  +z2  j 

4r2~-i-V2 


e*p[ 

Ip2  +q2  +b2  z j 

- 

T 

<2  +q2  +  b2 

(30) 


,  where  r  =  ^jx2  +  y' 
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and  applying  the  convolution  theorem  to  Eq.  (28),  we  can  obtain  an  expression  for  T2(x,y,z)  which  can  be 
substituted  into  Eq.  (23)  to  obtain  the  following  expression  for  the  dimensionless  temperature  T(x,y,z). 

T(x,y,z)  =  EE(Je1  2)  X  aSjr'dr'/Jx-r'cosB) 

Tt  m=0  n=0  o 

n 

jdBg/y-r'sinB) 

0 

The  integrals  in  Eq.  (31)  are  numerically  evaluated  by  using  the  trapezoidal  rule  for  numerical  integration 
to  calculate  the  temperature  distribution  for  various  process  parameters. 

We  need  to  know  the  thermophysical  properties  of  paints  and  substrate  (aluminum)  for  numerical 
calculating.  Unfortunately,  few  data  are  found  by  searching  the  relative  literature  and  by  contacting  some 
paint  manufacturers  and  research  centers  of  materials  properties  in  Purdue  University  and  Illinois  Institute  cf 
Technology.  The  basic  components  of  a  paint  consist  of  film-forming  substances,  resins,  plasticizers  and 
pigments  [22].  The  polymer  film  formers  can  either  be  macromolecular  products  or  low  molecular  mass 
compounds  that  react  to  form  macromolecules  on  curing.  Increasing  molecular  mass  of  the  polymer  film 
formers  results  in  improved  mechanical  properties  of  the  film  until  a  limiting  values  is  reached.  The 
polymer  film  former  of  the  first  group  are  represented  by  cellulose  nitrate  or  vinyl  chloride  co-polymers. 
The  second  group  includes  polyurethanes  or  epoxy  resins.  Too  much  resin  produces  hard  and  brittle 
coatings  whereas  to  much  platicizer  produces  soft  films.  Since  only  small  amounts  of  resins  and 
plasticizers  are  used,  they  also  may  be  classified  as  paint  additives.  Therefore,  the  physical  properties  of 
paint  is  mainly  determined  by  the  polymer  film.  Since  it  is  lack  of  thermophysical  properties  of  paints,  we 
may  use  the  data  of  polymers  as  an  estimation. 

However,  it  is  also  difficult  to  find  all  of  the  thermophysical  properties  of  a  polymer.  Fortunately,  the 
thermophysical  properties  of  almost  all  polymers  are  close,  so  we  can  choose  the  average  values  of  most 
polymers.  The  thermophysical  properties  of  polymers  are  collected  in  Appendix.  The  thermophysical 
properties  of  aluminum  [23,24]  and  polymers  [see  Appendix]  used  for  numerical  calculation  are  listed  in 
Table  1. 
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Table  1  Thermophysical  properties  of  polymers  (Appendix)  and  aluminum  [23,241 


Polymers 

Aluminum 

Density  p  (kg/mJ) 

1200 

2700 

Specific  heat  cp  (xlO3  J/kg-K) 

2.0 

0.899 

Latent  heat  of  melting  (xlO5  J/kg) 

0.65 

0.97 

Latent  heat  of  vaporization  (xlO5  J/kg) 

2.72 

112.97 

Melting  temperature  (K) 

473 

933 

Thermal  decomposition  temperature  (K) 

593 

Thermal  conductivity  (W/mK) 

0.2 

92 

Thermal  diffusivity  (xlO-4  m:/s) 

0.1 

38 

Results  and  Discussion 


The  vaporizing  depth  of  paint  and  temperature  of  substrate  surface  are  calculated  numerically  by  Eqs.  (10) 
and  (31)  with  different  process  parameters  and  laser  modes.  The  absorptivity  is  quite  high  for  paint  [25]  and 
is  taken  to  be  80%  in  this  study.  The  paint  thickness  of  1  mm  is  chosen  in  the  calculation. 


The  power  density  distribution  of  TEM3t  COIL  beam  with  a  10  mm  in  width  and  length  is  shown  in 
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Fig.  3.  It  should  be  noted  that  M=  3  and  N=\  correspond  to  m= 0, 1,2,3  and  «=0,1  respectively.  For  the 
beam  having  m= 0, 1,2,3  and  n=0,l,  it  means  that  the  beam  has  4  and  2  modes  in  the  *  and  y  directions 
respectively.  Due  to  the  particular  choice  of  the  power  distribution  in  each  mode  given  by  Table  II,  the 
intensity  is  low  at  the  beam  center  and  it  becomes  maximum  near  the  edges  of  the  beam  in  both  x  and  y 
directions  as  shown  in  Fig.  3.  There  are  4  major  peaks  of  power  density  at  comers  and  4  minor  peaks  in 
the  center.  The  paint  front  under  the  TEM3i  COIL  beam  irradiation  and  the  temperature  distribution  of 
substrate  surface  in  the  area  of  laser  beam  are  shown  in  Figs.  4  and  5. 


Table  II  Power  distribution  for  TEMji  COIL  beam  (W) 


n 

0 

1 

0 

7.5 

7.5 

m 

1 

30 

30 

2 

75 

75 

3 

112.5 

112.5 

At  the  front  edge  of  laser  spot  in  Fig.  4,  the  paint  begins  to  be  heated  and  nothing  is  removed.  When  the 
laser  beam  moves  in  x  direction,  the  paint  under  the  laser  irradiation  is  heated  up  to  melting,  even 
vaporizing  melting  and  the  paint  is  vaporized  continually.  There  are  two  valleys  because  there  are  two 
peaks  of  the  power  density  in  this  region.  At  the  rear  edge  of  laser  spot,  the  cross  section  of  the  paint  looks 
like  a  letter  "W".  Paint  cannot  be  removed  at  the  both  sides  of  laser  spot  because  of  the  low  power  density. 
The  temperature  distribution  of  substrate  surface  is  related  to  both  the  peaks  of  power  density  and  thickness 
of  paint.  The  highest  temperatures  are  located  around  rear  peak  of  power  density  as  shown  in  Fig.  5. 


FIG.  5.  Three-dimensional  distribution  of  normalized  surface  temperature 
(TEM3],  P=45  W,  v=10  cm/s,  211=2w,=10mm) 

The  final  cross-sectioned  shapes  of  paint  after  TEM31  COIL  beam  scanning  is  like  a  letter  "W"  when  the 
laser  power  is  not  strong  enough.  The  paint  will  be  vaporized  completely  around  the  center  of  laser  spot 
when  the  laser  power  increases  as  shown  in  Fig.  6.  It  can  be  seen  form  the  figure  that  the  paint  in  the 
central  area  is  removed  when  the  laser  power  is  high  enough  (about  500  W).  However,  the  removed  width 
of  paint  in  the  central  region  cannot  be  expect  to  increase  greatly  with  the  increasing  of  laser  power.  The 
effect  of  scanning  speed  of  laser  beam  on  the  paint  removal  is  shown  in  Fig.  7.  The  removed  width 


6 


Transverse  distance,  y  [mm] 


FIG.  6.  Variation  of  cross-sectioned  shape  of  paint  with  laser  power 
(TEM3I ,  v=10  cm/s,  21  =2W  =10  mm) 


Transverse  distance,  y  [mm] 

FIG.  7.  Variationg  of  cross-sectioned  shape  of  paints  with  scanning  speed 
(TEM  ,  P=450  W,  21 -2W  =10  mm) 
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increase  with  the  decrease  in  laser  scanning  speed.  However,  we  do  not  wish  to  reduce  the  speed  too  much 
because  the  high  efficiency  of  paint  removal  is  what  we  need. 

When  the  laser  power  and  other  process  parameters  are  kept  as  constant  and  only  the  width  of  laser  beam  is 
increasing,  the  removed  width  of  paint  increases  as  shown  in  Fig.  8.  This  means  that  the  wider  beam 
should  be  chosen  to  remove  the  more  paint  when  the  laser  power  is  high  enough.  If  the  width  of  laser 
beam  is  kept  unchanged  and  the  high  laser  power  is  applied,  the  rest  of  laser  power  other  than  that  used  to 
vaporize  the  paint  will  be  utilized  to  heat  up  the  substrate  and  may  cause  the  damage  of  substrate  such  as 
melting  and  degrading  of  mechanical  properties.  Similar  situation  occurs  when  the  laser  power  is  increased 
and  other  parameter  are  kept  constant  as  shown  in  Fig.  6.  The  increase  in  removed  width  of  paint  is 
limited  even  the  laser  power  increases  dramatically.  The  increased  amount  of  laser  energy  is  conducted  to 
substrate  for  heating  up  the  substrate.  Our  aim  is  to  remove  the  paint  completely  without  damaging  the 
substrate  and  the  further  heating  of  substrate  should  be  avoided. 


FIG.  8.  Cross-sectioned  shape  of  paint  at  diffemt  width  of  laser  beam 
(TEM3i,  P=2.2 5  kW,  v=10  cm/s,  21  =10  mm) 


The  removed  width  of  paint  and  the  highest  temperature  on  substrate  surface  for  different  laser  power  are 
summarized  in  Fig.  9.  This  figure  indicates  that  the  surface  temperature  increase  greatly  and  removed 
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width  of  paint  has  a  limited  increase  when  the  laser  power  is  high.  In  the  situation  shown  in  Fig.  9,  an 
optimum  laser  power  is  found  to  be  about  1  kW  as  the  removed  width  of  paint  increases  is  not  expected  to 
increase  a  lot  with  increasing  laser  power  but  the  higher  power  than  1  kW  will  causes  the  rapid  rise  in 
surface  temperature.  There  is  a  maximum  removed  width  of  paint  at  which  the  surface  temperature  reaches 
the  melting  point  of  substrate.  The  maximum  removed  width  of  paint  is  about  6.4  mm  in  Fig.  9.  When 
the  laser  power  is  less  than  a  minimum  value  which  is  about  500  Win  Fig.  9,  the  laser  power  is  not  strong 
enough  to  vaporize  the  paint  completely  from  substrate.  Similarly,  we  believe  that  there  is  a  optimum 
laser  power  when  other  parameters  such  as  scanning  speed,  beam  width  and  length  etc.  changes.  The 
relative-wider  area  paint  is  removed  and  the  temperature  of  substrate  surface  is  not  high  when  the  optimum 
laser  power  is  applied.  The  optimum  laser  power  can  be  identified  by  the  model  developed  in  this  study. 


FIG.  9.  Variation  of  removed  width  and  highest  normalized  temperature  of  substrate 
surface  with  laser  power  (TEM^,  v=10  cm/s,  21|=2W^=10  mm) 


We  have  discussed  the  paint  removal  by  TEM31  COIL  beam.  The  TEMoo  mode,  e.g.,  Gaussian  beam,  is 
often  used  in  laser  materials  processing.  The  three-dimensional  distribution  of  square  COIL  Gaussian  beam 
is  shown  in  Fig.  10,  which  is  a  little  different  from  circle  Gaussian  beam  in  edges.  The  paint  front 
irradiated  by  square  COIL  Gaussian  beam  is  shown  in  Fig.  1 1.  It  is  easy  to  understand  that  more  paint  is 
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FIG.  10.  Three-dimensional  distribution  of  power  density  for  TEMqq  COIL  beam 
(Power  400  W) 


removed  in  rear  edge  than  front  edge  of  laser  spot  and  the  paint  valley  is  located  at  the  center  of  rear  edge  of 
beam.  The  cross  section  of  paint  is  similar  to  the  Gaussian  distribution.  However,  the  highest  surface 
temperature  is  a  little  behind  the  center  of  the  square  beam  in  the  x  direction. 

It  is  interesting  to  compare  the  effect  of  TEMoo  and  TEM3,  modes  on  the  process  of  paint  removal.  First  we 
make  a  plot  of  cross-sectioned  shape  of  paint  changed  with  laser  power  for  TEMoo  COIL  beam  as  shown  in 
Fig.  12.  There  is  a  similar  trend  as  TEM3,  that  the  removed  width  increases  in  a  limited  value  when  the 
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Thickness  of  paint  [mm] 


FIG.  11.  Cross-section  of  paint  after  TEMqq  COIL  beam  radiation 
(P=400  W,  v=10  cm/s,  21i=2\V|=10  mm) 


-4  -2  0  2  4 

Transverse  distance,  y  [mm] 


FIG.  12.  Variation  of  cross-sectioned  shape  of  paint  with  laser  power 
(TEM  ,  v=10  cm/s,  2w|=2!  =10  mm). 
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laser  power  is  high.  Much  of  laser  energy  is  dissipated  to  heat  up  the  substrate.  Similarly,  we  believe 
that  there  is  an  optimum  laser  power  at  which  the  relative  more  paint  is  removed  but  the  substrate 
temperature  is  not  high.  The  optimum  laser  power  of  TEM00  is  found  to  be  about  1  kW  same  as  that  of 
TEM31.  This  indicates  that  the  optimum  laser  power  is  not  relative  with  laser  mode. 

The  removed  width  of  paint  and  temperature  of  substrate  surface  for  TEM31  and  TEM00  COIL  beams  are 
compared  in  Figs.  13  and  14.  The  removed  widths  of  paint  for  TEM31  and  TEMoo  beams  are  almost  same 
when  the  laser  power  is  below  the  optimum  power,  but  the  width  of  TEMoo  is  a  little  higher  than  that  of 
TEM31  when  the  higher  power  is  applied  (Fig.  13).  This  is  because  that  the  power  density  of  TEM00  in 
the  both  side  edges  is  a  little  higher  than  that  of  TEM31.  However,  the  temperature  of  substrate  surface 
caused  by  TEM31  beam  irradiation  is  always  higher  than  by  TEMoo.  This  situation  is  caused  by  the  higher 
values  at  the  peak  of  power  density  in  TEM3]  which  can  be  seen  clearly  by  comparing  their  distributions  cf 
power  density  in  Figs.  3  and  10.  To  avoid  the  high  temperature  of  substrate  surface  and  remove  more  paint, 


0  0.5  1  1.5  2  2.5  3  3.5 

Laser  power  [kW] 

FIG,  13.  Variation  of  removed  width  of  paint  with  laser  power 
(v=10  cm/s,  21  =2w(=10  mm). 


7-22 


FIG.  14.  Variation  of  normalized  temperature  of  substrate  surface  with  laser  power 
(v=10  cm/s,  21i=2wi=10  mm) 

the  uniform  distribution  of  power  density  is  needed.  That  is,  the  beam  quality  with  "flat  top"  or  uniform 
energy  density  distribution  is  very  important  for  large-area  surface  treatment  [5, 26]. 

In  this  study,  the  process  of  laser  paint  removal  is  presented  and  the  temperature  of  substrate  surface  is 
estimated.  However,  the  effect  of  laser  heat  on  mechanical  properties  of  substrate  should  be  investigated 
further.  Experiments  has  showed  that  the  degradation  in  ductility  of  aircraft  body  has  been  found  [10].  A 
detail  investigation  on  the  effect  of  laser  power,  scanning  speed  ,  and  laser  mode  on  mechanical  properties 
should  be  carried  out.  This  investigation  will  indicate  if  the  degradation  is  acceptable  by  practical  service 
of  aircraft.  The  further  improvement  to  this  model  by  both  experimental  and  theoretical  study  is  also 
important  to  predict  the  process  accurately. 
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Conclusions 


(1)  A  mathematical  model  for  laser  paint  removal  is  developed  in  this  study.  This  model  can  be  used  to 
estimate  the  processing  of  paint  removal  and  the  effect  of  laser  parameters  on  process.  The  temperature  cf 
substrate  surface  is  also  calculated  numerically  by  the  model. 

(2)  The  cross-sectioned  shape  of  paint  is  like  a  letter  of  "W"  after  TEM3i  COIL  beam  irradiation  and  it  is 
similar  to  Gaussian  distribution  after  TEMoo  beam  irradiation  when  the  laser  power  is  low.  The  paint 
around  the  center  of  beam  rear  edge  is  removed  completely  by  vaporizing  when  laser  power  is  high  enough. 

(3)  The  removed  width  of  paint  and  the  temperature  of  substrate  surface  increase  with  the  increase  in  laser 
power  and  decrease  in  beam  scanning  speed  and  beam  width.  There  is  optimum  laser  power  at  which  the 
relative  more  paint  is  removed  and  the  temperature  is  not  high.  The  optimum  value  is  affected  by  other 
process  parameters  and  is  not  relative  to  laser  mode. 

(4)  The  removed  width  of  paints  by  both  TEM3,  and  TEMoo  COIL  beam  are  close  to  each  other,  but  the 
TEM3)  beam  causes  higher  temperature  rise  on  substrate  surface. 

(5)  The  uniform  distribution  of  power  density  is  good  for  laige-area  surface  treatment.  The  high  value  of 
peak  power  density  may  damage  the  substrate  or  cause  the  further  degradation  in  properties  of  substrate. 
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Appendix  Thermophysical  Properties  of  Polymers 


(1)  Thermal  conductivity  (W/mK) 


Polymer 

Thermal  Conductivity  (W/mK) 

Reference 

Epoxy  resin 

0.2 

[A-7],[A-1 1] 

Polyethylene 

0.2 

[A-2] 

LDPE 

0.33 

[A-2] 

HDPE 

0.65 

[A-2] 

PTEF 

0.3 

[A-l  1] 

Polyvinyl  carbazole 

0.16 

[A- 12]  (p.  972) 

TAC  polyester 

0.18 

[A-12]  (p.  976) 

Natural  and  synthetic  rubber 

0.21-0.32 

[A-12]  (p.  972) 

Polyvinyl  choloride 

0.17  at  300-500  K 

[A-12]  (p.  1086) 

Polystyrene 

0.16 

[A-12]  (p.  1090) 

The  thermal  conductivity  of  polymers  is  in  the  range  of  0.15-0.45  W/mK  [A-6].  It  is  in  about  0.15-0.3 
W/mK  for  most  polymers  as  shown  in  Fig.  A-l  [A-6]. 
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FIG.  A-l.  Thermal  conductivity  versus  temperatures 
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(2)ThermaI  diffiisivity 


Polymers 

Thermal  Diffusivity  (lO-6  m2/s) 

Reference 

EP 

0.06 

[A-ll] 

HDPE 

0.9 

[A- 11] 

Natural  rubber 

0.04-0.15  at  300-500  K 

[A- 12]  (p.  1058) 

Synthetic  rubber 

0.06-0.16 

[A- 12]  (p.  1060) 

Polyvinyl  chloride 

0.09 

[A- 12]  (p.  1082) 

Polyvinyl  cabazole 

0.1 

[A- 12]  (p.  1082) 

Phenolic  resin 

0.125 

[A- 12]  (p.  1082) 

TAC  polyester 

0.115 

[A- 12]  (p.  1082) 

(3)  Melting  temperature 

Polymer 

Melting  Temperature  (°C) 

Reference 

PVDC 

198-205 

[A-3] 

Poly(vinyl  chloride) 

200-220 

[A-5] 

Polytetrafluoroethylene 

310-340 

[A-4] 

Polyvinlidene  fluoride 

210 

[A-9] 

Polypropylene 

176 

[A-9] 

Nylon-6 

225 

[A-9] 

PVC 

180 

[A-9] 

Polystyrene 

230 

[A-9] 

Polymethylene 

108-136 

[A- 10] 

(4)  Thermal  decomposition  temperature 

Polymer 

Thermal  Decomposition  Temperature  (°C) 

Reference 

Poly(methl  methacrylate)  in  H2 

310 

[A-4] 

in  Air 

242 

[A-4] 

Polystyrene  in  H2 

337 

[A-4] 

Polystyrene  filled  with  Si02  in  H2 

337 

[A-4] 

Phenol-formaldehyde  resin  in  H2 

432 

[A-4] 

Benzhydrylated  cellulose  in  Air 

280-310 

[A-4] 

(5)  Specific  heat 
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Polymer 

Specific  Heat  (J/gK) 

Reference 

polyethylene 

2.3  at  room  temperature 

[A- 10] 

2.9  at  12-140  °C 

[A- 10] 

Polyvinyl  carbazole 

1.24 

[A- 12]  (p.  970) 

Cresol  resin 

1.6 

[A- 12]  (p.  1004) 

Nylon 

1 .3  at  room  temperature 

[A- 12]  (p.  1047) 

2.09  at  100  °C 

[A- 12]  (p.  1047) 

2.50  at  1 80  °C 

[A- 12]  (p.  1047) 

Synthetic  rubber 

1.5 

[A- 12]  (p.  1054) 

Silicon  resin 

1.36  at  300-650  K 

[A- 12]  (p.  1072) 

Phenyl  silane  resin 

1. 4-2.1  at  300-650  K 

[A- 12]  (p.  1074) 

PVC  resin 

1.25 

[A-12] 

Poly(vinyl  chloride) 

Q=0.24S+S.46x/0-'r  (Cal/°Cg3) 

at  20-120  °C 

[A-5] 

(6)  Latent  heat  of  fusion 

Polymer 

Latent  Heat  of  Fusion  (J/g) 

Reference 

Vinylidene  Chloride  (VDC) 

65-82 

[A-3] 

PVDC 

43-47 

[A-3] 

PE 

176-213 

[A-4] 

(7)  Latent  heat  of  evaporation 

Latent  heat  of  evaporation  (J/g) 

Reference 

Vinylidene  Chloride  Monomer 

272 

[A-3] 

(8)  Heat  of  Combustion 

Polymer 

Heat  of  Combustion  (kJ/g) 

Reference 

Rubber 

40 

[A-10] 

Polyisobutylene 

47 

[A- 10] 

Polypropylene 

46 

[A-10] 

Polystyrene 

41 

[A-10] 

ABS 

36 

[A-10] 

Polyformaldehyde 

17 

[A-10] 

Poly(vinyl  chloride) 

18 

[A-10] 
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Poly(methyl  methacrylate)  25  [A- 10] 

(9)  Density 

Paint  or  Polymer 
An  Aliphatic  Polyurethane  Gloss 
Titanium  Dioxide  Pigmented  Paint 
(Federal  Stardand  Color  17875) 

Polyethylene 
Phenolic  resin 
Natural  and  synthetic  rubber 
TAC  Polyester 

(10)  Selected  thermophysical  properties  for  paints 


Thermal  conductivity 

0.2 

W/mK 

Thermal  diffusivity 

0.1x1  O'6 

m2/s 

Melting  temperature 

200 

°C 

Thermal  decomposition  temperature 

320 

°C 

Specific  heat 

2.0 

J/gK 

Latent  heat  of  fusion 

65 

J/g 

Latent  heat  of  vaporization 

272 

J/g 

Density 

1.2 

g/cm3 

[A-l]  D.  Stoye,  Paints,  Coatings  and  Solvents,  VCH  mbH,  Weinheim,  1993,  pp.3-7. 
[A-2]  D.  Porter,  Group  Interaction  Modelling  of  Polymer  Properties,  Dow  Benelux, 

N.V.,  The  Netherlands,  1995,  pp.  241. 

[A-3]  R.  A.  Wessling,  Polyvinylidene  Chloride,  Gordon&  Breach  Science,  New  York, 
1977,  pp.  7,  70,  81,  106. 

[A -4]  R.  F.  Schwenker,  Jr.,  Thermoanalysis  of  Fibers  and  Fiber-Forming  Polymers,  John 
Wiley  &  Sons,  New  York,  pp.81,  90,  128. 

[A-5]  J.  V.  Koleske  and  L.  H.  Wartman,  Poly(vinyl  chloride),  Gordon&  Breach  Science, 
New  York,  1969,  pp.  62,  65. 

[A-6]  P.  C.  Powell,  Engineering  with  Polymer,  Chapman  and  Hall,  London,  1983,  pp. 
240-243. 

[A-7]  H.  Lee  and  K.  Neville,  Handbook  of  Epoxy  Resins,  McGraw-Hill,  New  York, 

1967,  p.6-32. 


Density  Reference 

14.709  lbs/gal  [A-8] 

0.92-0.96  g/cm3  [A- 10] 

1.30  g/cm3  [A- 12]  (p.  980) 

1.2- 1.7  g/cm3  [A-12]  (p.  1051) 

1.23  g/cm3  [A-12]  (p.  974) 
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[A-8]  U.S.  Paint  Company,  St.  Louis,  MO,  personal  communication. 

[A-9]  Handbook  of  Polymer  Science  and  Technology,  Vol.  1:  Synthesis  and  Properties, 
Ed.N.  P.  Cheremisionoff,  Marcel  Dekker,  New  York  and  Basel,  1989,  p.  495. 

[A- 10]  Handbook  of  Polymer  Science  and  Technology,  Vol.  2:  Performance  Properties  of 
Plastics  and  Elastomers,  Ed.  N.  P.  Cheremisionoff,  Marcel  Dekker,  New  York  and 
Basel,  1989,  pp.  207,  352,  355,  363. 

[A- 11]  G.  Hartwig,  Polymer  Properties  at  Room  and  Cryogenic  Temperatures,  Plenum, 
New  York  and  London,  1994,  pp.  110,  112,  114. 

[A-12]  Y.  S.  Touloukian  and  C.  Y.  Ho,  Thermophysical  Properties  of  High  Temperature 
Solid  Materials,  Vol.  6,  Part  II:  Oxidment,  Polymer  and  Composite,  Macmillan, 
New  York,  1967. 
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ANALYTICAL  NOISE  MODELING  AND  OPTIMIZATION  OF  A  PHASOR-BASED 
PHASE  RECONSTRUCTION  ALGORITHM 

Thomas  F.  Krile 
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Department  of  Electrical  Engineering 
Texas  Tech  University 


Abstract 

An  important  procedure  in  the  areas  of  adaptive  optics  and  speckle  imaging  is  to 
reconstruct  an  estimate  of  an  incident  wavefront's  phase  field,  given  measurements  of  the 
incident  field's  phase  differences.  Optimal  linear  procedures  for  this  phase  reconstruction 
are  well  known,  and  a  newer  phasor-based  technique  has  recently  been  introduced.  In  a 
typical  phase  reconstruction  algorithm,  forming  a  weighted  sum  of  phases  and/or  phase 
differences  is  an  often-repeated  operation,  and  the  phasor-based  approach  minimizes  the 
problem  of  2n  phase  ambiguities  at  points  of  zero  field  amplitude  by  averaging  phasor 
paths  instead  of  phase/phase  difference  paths.  While  there  are  general  analytical  models 
available  for  studying  the  performance  of  linear  reconstructors  in  the  presence  of  phase 
measurement  noise,  no  such  models  exist  for  the  phasor-based  case.  Until  now,  noise 
studies  of  the  new  phasor-based  reconstruction  technique  were  done  using  computer 
simulations  with  randomly  generated  noise  data. 

In  this  paper,  an  analytical  technique  for  finding  the  probability  density  function  (pdf) 
of  the  output  noise,  and  thence  the  noise  variance  and  noise  gain,  for  phasor-based  path 
averaging  systems  is  developed.  Variable  path  weights  have  been  incorporated  as  inputs 
to  the  output  noise  pdf  algorithm  which  characterizes  the  phasor-based  reconstructor. 
These  path  weights  have  been  optimized  to  minimize  the  output  noise  variance,  and 
robustness  and  optimality  results  are  compared  to  those  obtained  using  path  weights 
derived  from  optimal  linear  techniques. 
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ANALYTICAL  NOISE  MODELING  AND  OPTIMIZATION  OF  A  PHASOR-BASED 


PHASE  RECONSTRUCTION  ALGORITHM 
Thomas  F.  Krile 
James  L.  Fox 
I.  INTRODUCTION 

An  important  procedure  in  the  areas  of  adaptive  optics  and  speckle  imaging  is  to 
reconstruct  an  estimate  of  an  incident  wavefront’s  phase  field,  given  measurements  of  the 
incident  field’s  phase  differences.  In  adaptive  optics  systems,  the  phase  field  information 
is  then  used  to  compensate  for  atmospheric  disturbances  [1].  In  the  case  of  speckle 
imaging,  the  phase  field  information  is  used,  along  with  intensity  measurements,  to 
reconstruct  the  complex  far  field  pattern  of  the  original  object,  which  is  then  Fourier 
transformed  to  obtain  an  estimate  of  the  object[2,3]. 

Two  methods  of  phase  reconstruction  that  are  currently  used  are  linear  [1,4]  and 
nonlinear  (phasor-based)  [2]  reconstruction.  This  paper  concentrates  on  the  effect  that 
phase  measurement  noise  has  on  these  two  methods  of  phase  reconstruction.  Until 
recently,  noise  studies  of  a  new  non-linear  phasor-based  reconstruction  technique  were 
done  using  computer  simulations  with  randomly  generated  noise  data.  These  studies 
indicated  that,  as  far  as  noise  performance  is  concerned,  the  non-linearity  introduced  into 
the  reconstruction  process  by  the  phasor-based  approach  leads  to  markedly  different 
(inferior)  noise  behavior  than  that  for  the  non-phasor  (linear)  case  for  a  range  of  input 
noise  variances  [5,6,]. 

This  paper  will  present  an  analytical  method  for  finding  the  probability  density 
function  (PDF)  and  variance  of  the  output  noise  of  the  phasor-based  reconstructor [7]. 
Variable  path  weights  have  been  added  as  inputs  to  the  output  noise  PDF  program  which 
characterizes  the  non-linear  reconstructor.  These  path  weights  have  been  optimized  to 
minimize  the  output  noise  variance  and  results  are  compared  to  those  obtained  using  path 
weights  derived  from  optimal  linear  techniques.  The  robustness  of  the  output  noise 
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variance  with  respect  to  the  input  noise  variance  estimates  will  also  be  examined.  Section 
II  of  this  paper  discusses  the  development  of  the  linear  and  phasor-based  noise  models. 
From  these  models,  the  probability  density  function  of  On,  the  reconstructor  output  noise, 
will  be  developed  in  Section  III.  Next,  Section  IV  will  discuss  the  method  for  finding  the 
optimal  phasor-based  weights  for  the  reconstructor.  Section  V  will  present  a  comparison 
of  the  noise  performances  of  each  of  the  two  reconstructors  (phasor  and  non-phasor). 
Noise  gain,  optimality  of  the  phasor  weights  and  robustness  to  input  noise  estimations 
will  also  be  mentioned.  Conclusions  are  given  in  Section  VI. 

II.  NOISE  MODEL  DEVELOPMENT 

2.1.  LMS  Phase  Reconstruction  Model 

The  least  mean  square  (LMS)  phase  reconstruction  method  uses  a  weighted  sum 
of  phases  and/or  phase  differences  to  reconstruct  the  wavefront  of  an  incident  waveform 
from  its  reflected  speckle  pattern.  A  problem  with  this  method  of  wavefront 
reconstruction  is  the  occurrence  of  271  phase  ambiguities  at  points  of  zero  phase  field 
amplitude  in  the  measured  speckle  pattern.  For  this  reason,  the  technique  of  averaging 
phasor  paths  instead  of  phase/phase  difference  paths  has  been  studied  [5],  Unfortunately, 
this  is  a  non-linear  process,  as  far  as  noise  is  concerned,  and  produces  a  significantly 
different  noise  behavior  than  that  for  the  non-phasor  case.  A  common  operation  for 
finding  the  phase,  Oy,  from  its  surrounding  phase  and  phase  differences  is  shown  in 
Figure  2.1.  This  figure  represents  a  small  portion  of  the  reflected  speckle  pattern.  In  the 
figure,  Sxk  ]=  Ok  |+1  -  Ok  l  and  SYk  l  =  Ok+1!  -  Ok The  subscript  terms  represent  the  y- 
and  x-direction  phase  measurement  matrix  coordinates,  the  S’s  represent  the  measured 
phase  differences  and  the  O  terms  represent  the  phase  in  the  speckle  pattern. 
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¥ 


Figure  2.1.  Paths  for  the  phase  and  phase 

difference  measurements  are  shown  here  [5]. 

The  common  linear  LMS  estimator  for  ®y  is  shown  in  equation  (1). 

Oij  =  w1-(OiJ.1+S^_1)+w2-(Oi.1J+S^_1.)  + 

w3*(Oi+y-  Sy)+w4-(Oij+1-  Sy).  (1) 

In  equation  (1),  if  all  noise  variances  of  the  phase  and  phase  differences  are 

assumed  to  be  equal,  then  all  of  the  path  weight  terms,  wn,  will  also  be  equal  with  a  value 

of  0.25.  This  will  accomplish  the  goal  of  minimizing  the  variance  of  ®y.  It  was 

assumed  that  each  of  the  noise  terms  are  additive,  uncorrelated,  and  represent  normally 

distributed  random  variables  with  a  variance  of  Gn2.  Considering  these  assumptions,  the 

mean  square  error  of  ®y  can  be  found  to  be  E2  =  a2  =  (1/2)  an2. 

2.2.  Phasor-based  Phase  Reconstruction  Model 
In  phasor  form,  equation  (1)  can  be  rewritten  as  [5]: 
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(2) 


OiJ=Arg(w1-exp[j(OiJ.1  +Sy.1)]+w2-exp[j(Oi.1J  +S?^)]+ 
w3-expO(Oi+1J  -S^)]+w4-exp[j(Oij+1  -S* .)]). 

1sJ 

Similarly,  the  noise  portion  of  Oy  can  be  written  as: 

f  \  (  \ 


where  the  n’s  are  normally  distributed,  independent,  zero-mean  random  variables  with  a 
variance  of  CTn  .  As  shown  in  equation  (4),  equation  (3)  can  be  rewritten  in  trigonometric 
form  as: 


On  =  Tan'1  (X/Y),  -7t<On<7t, 


(4) 


where  X  and  Y  are  defined  as  follows: 

(  -  A 


X=W]-sin 


tn. 

U=i  ) 


+  w2-sin 


in, 

si=i 


ink 


+  w3-sin|  2^nk  |  +  w4-sin 

c=l 


2-nj 

Vi-1  J 


(5) 

(6) 


III.  PDF  OF  On 
3.1.  Introduction 

In  order  to  help  characterize  the  output  noise  of  the  reconstructor,  the  probability 
density  function  (PDF)  of  On  from  equation  (4)  must  be  found.  Due  to  the  complex 
nature  of  ®n,  it  would  be  quite  difficult  to  directly  find  its  PDF.  Therefore,  the  On 
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expression  was  broken  down  into  smaller,  more  manageable  parts.  The  PDF’s  of  each  of 
these  smaller  expressions  were  found  and  then  used  to  indirectly  calculate  the  PDF  of  On. 

3.2.  PDF  of  Sine  and  Cosine  Terms 

The  first  of  the  simpler  PDF’s  to  be  found  is  that  of  each  of  the  sine  terms  in 
equation  (5).  The  argument  of  each  of  the  sine  terms  is  denoted  by  T,  where  T  is  the 
summation  of  two  independent,  normally  distributed,  N(0,CTn2),  random  variables.  Due  to 
the  nature  of  these  noise  terms,  and  assuming  that  the  noise  variances  within  a  path  are 
equal,  T  is  found  to  be  a  normally  distributed,  N(0,2cin2),  random  variable.  From  this 
development,  the  PDF  of  U=w-sin(T)  is  shown  in  equation  (7)  and  derived  in  Appendix 
A. 

f;  (u)=[  1  Aj (4jia„2(w2-u2))]  E[exp(-T2/(4c„2))].  (7) 

Here  the  summation  includes  all  T’s  such  that  T=arcsin(u/w)  and  u  ranges  from  -w  to  w. 
The  PDF  in  equation  (7)  approaches  a  zero-mean  normal  distribution  with  a  variance  of 
2an2  for  small  values  of  an2  and,  as  an2  increases,  the  PDF  approaches  1/(tc  Ay/( wW))). 
The  latter  form  is  that  of  the  PDF  of  a  general  sinusoid  with  uniformly  distributed  phase 
angle.  Figure  3.1  shows  how  this  PDF  changes  as  the  noise  variance,  an2,  increases. 

Through  a  similar  development,  the  probability  density  function  of  each  cosine 
term,  V=w-cos(T),  in  equation  (6)  is  found  to  be: 

f;(v)=[W(4jtan2(wW))]  E[exp(-T2/(4o„2))].  (8) 
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This  time  the  summation  is  taken  to  include  all  T’s  such  that  T=arccos(v/w),  where  v 
ranges  from  -w  to  w.  The  PDF,  fv(v),  approaches  a  delta  function,  8(v-w),  as  cn2 
approaches  zero.  For  larger  values  of  an2,  the  PDF  approaches  1/(tt  /■\/(w2-v2)).  Figure 
3.2  shows  how  fv(v)  changes  as  the  noise  variance,  an2,  increases. 


pdf  of  0.25*sin(T),  variance=0.1  pdf  of0.25*sin(T),  variance=0.3 


Figure  3.1.  The  PDF  of  U=w-sin(T)  versus  an2. 
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pdf  of0.25*cos(T),  variance=0.1 


pdf  of  0.25*cos(T),  variance=0.3 


Figure  3.2.  The  PDF  of  V=w-cos(T)  versus  crn2. 

3.3.  PDF  of  Sum  of  Sine  and  Cosine  Terms 
Remembering  that  X  and  Y  are  the  summation  of  four  sine  and  cosine  terms, 
respectively,  the  PDF’s  of  X  and  Y  can  now  be  found.  Since  each  of  the  four  sine  terms 
in  X  are  independent,  the  PDF  of  X  is  simply  the  four-fold  convolution  of  the  PDF’s  of 
each  of  the  sine  terms.  Likewise,  the  PDF  of  Y  is  the  four-fold  convolution  of  the  PDF’s 
of  each  of  the  cosine  terms.  All  of  the  noise  terms  in  the  arguments  of  the  sine  and  cosine 
terms  are  assumed  to  be  equal.  Figure  3.3  shows  the  PDF’s  of  X  for  various  noise 
variances.  The  PDF  of  X  varies  from  Gaussian  for  small  an2  to  a  triangularly-shaped 
PDF  as  an2  increases. 
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Figure  3.4  shows  the  PDF’s  of  Y  for  various  noise  variances.  As  can  be  seen 
from  the  figure,  the  PDF  of  the  sum  of  four  0.25-cos(T)  terms  is  approximately  a  delta 
function,  8(y-l),  for  small  input  variances,  and  evolves  into  a  somewhat  triangularly- 
shaped  PDF,  as  seen  also  in  Figure  3.3,  as  the  input  noise  variance  increases.  Thus,  for 
large  an2,  the  X  and  Y  PDF’s  look  the  same. 


pdf  of  sum  offour0.25*sinfr)'s,  variance=0.1  pdf  of  sum  offour0.25*sin(T)'s,  variance=0.3 


Figure  3.3.  Probability  density  function  of  X  as  the  noise 
variance  increases. 
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pdf  of  sum  offour0.25*cos(T)'s1  variance^.  1  pdf  of  sum  offourO^cosfTys,  variance=0.3 


Figure  3.4.  Probability  density  function  of  Y  as  the  noise 
variance  increases. 


3.4.  PDF  of  Z 


Now,  since  the  PDF’s  of  X  and  Y  have  been  found,  the  next  step  is  to  find  the 
PDF  of  Z=X/Y.  To  do  this,  an  auxiliary  random  variable  will  be  defined  as  M=Y.  The 
development  of  the  PDF  of  Z  is  demonstrated  here. 


Z=X/Y=G1(x,y). 


Define  an  auxiliary  variable 


M=Y=G2(x,y), 


X=ZY=ZM. 
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Then 


fzm(z,m)  = 


fxv(xfy) 

1 3T’ 


where  J'1  denotes  the  inverse  Jacobian  as  shown  below. 


dGj/dx  dG,/dy 

1 IY  -X/Y2 

dG2/dx  dG2/dy 

0  1 

1 


|M| 


Therefore,  fzm(z,m)=|m|-fxy(m-z,m) 

where,  fxy(x,y)=fx(x)-fy(y), 

when  X  and  Y  are  independent. 

Then,  to  find  fz(z),  the  variable  m  is  integrated  out,  as  follows. 


f2(z)= 


f  |m|-fxy(m-z,m)  dm 


5 


V-00 


J 


or 


r 


fz(z)= 


oo 


A 


f  |m|-fx(m-z)-fy(m)  dm 

V-oo  J 


(9) 


Equation  (9)  represents  the  probability  density  function  of  Z=X/Y.  Figure  3.5 
shows  how  the  PDF  of  Z  varies  as  the  input  noise  variance,  ct„2,  increases.  For  small 
an2’s,  fz(z)  has  a  Gaussian  distribution.  As  an2  increases,  the  PDF  approaches  a  Cauchy 
distribution. 
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Figure  3.5.  This  is  the  PDF  of  Z  =  X/Y  for  various  values 
ofan2. 


3.5.  PDF  of  ArcTanfZ^ 

Now,  the  PDF  of  On=ArcTan(Z),  from  equation  (4),  must  be  found.  Since  the 
inverse  tangent  function  retons  only  values  in  the  range  of  -tc/2  to  nil,  the  four-quadrant 
inverse  tangent  function  must  be  used  in  order  to  return  values  in  the  range  of  -n  to  n,  as 
desired.  This  is  done  by  dividing  fx(x)  and  fy(y)  into  their  positive  and  negative  parts,  and 
then  using  these  new  PDF’s  to  recalculate  f2(z).  The  new  PDF’s  for  X  and  Y  are  shown 
in  equations  (10)  and  (11),  respectively. 

fx(x)=fx.(x)  +  fx+(x),  (10) 

fy(y)=fy-(x)  +  fy+(y).  (11) 
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Using  equations  (10)  and  (11),  the  PDF  of  Z  shown  in  equation  (9)  can  be 


rewritten  as  shown  in  equation  (12). 


f 


m 


00 


A 


/  M  •  ([fx+(m-z)+fx.(m-z)]  •  [fy+(m)+fy_(m)])  dm 

V-CO  J 


=  Wz)  +  fz+.(z)  +  Uz)  +  Uz). 


(12) 


Now,  a  new  random  variable  is  defined  to  be  P=®n=Tari'(Z),  and  its  PDF  can  be 
found  to  be: 


fP(p)  =  fz(tan(p))  •  |l+tan2(p)|, 

f  (fz-(tan(p)))  •  |l+tan2(p)| ,  -n<p<-n/2 
or  fP(p)  =  i  (fz++(tan(p))  +  fz+.(tan(p)))  •  |l+tan2(p)|,  -7i/2<p<7t/2 
l  (fz.+(tan(p)))  •  |l+tan2(p)|,  7u/2<p<7t. 

(13) 


The  PDF  of  P  varies  from  a  normal  distribution,  N[0,<rn2/2]  for  small  an2  to 
uniform  U[-7i,7i]  as  an2  increases,  as  shown  in  Figure  3.6. 

Now,  a  method  for  finding  an  expression  for  the  PDF  of  the  output  noise  of  a 
path-averaging,  phasor-based  phase  reconstructor  as  a  function  of  the  input  noise  variance 
and  path  weights  has  been  found. 
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Figure  3.6.  Probability  density  function  of  P=On  for 
several  cn2. 


the  next  step  is  to  find  the  variance  of  ®n.  Since  <Dn  is  a  zero-mean  random 
variable,  the  variance,  a^2,  can  be  found  from  the  PDF  of  P  by  using  equation  (14): 

f 


2  _ 


><D 


00 


f  P2  •  fp(p)  dp 


V-00 


(14) 


J 


This  is  the  PDF  of  the  output  noise  of  the  phasor  reconstructor. 

Until  now,  the  model  has  used  four  paths  in  order  to  reconstruct  the  phase 
information,  as  shown  in  Figure  2.1.  It  is  now  convenient  to  consider  the  case  where  only 
two  paths  are  used  in  the  reconstructor.  For  the  2-path  case,  the  two  paths  in  the  x- 
direction  are  assumed  to  have  the  same  noise  variance  and  the  two  paths  in  the  y- 
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direction  are  assumed  to  have  the  same  noise  variance,  but  different  from  the  x-direction. 


Therefore,  both  W;’s  in  the  x-direction  would  have  the  same  value  and  both  w/s  in  the  y- 
direction  would  have  the  same  value.  The  conversion  process  from  a  4-path  problem  to  a 
2-path  problem,  for  the  linear  reconstructor  case  is  demonstrated  in  the  beginning  of 
Appendix  B.  Below  are  the  2-path  equations  for  the  phasor  case  corresponding  to 
equations  (5)  and  (6). 


Using  optimal  linear  weights  for  the  x-  and  y-directions,  wx  and  wy,  and  equation 
(14),  the  output  variance  of  the  phasor-based  reconstructor  has  been  calculated  for  several 
different  input  noise  variances.  These  output  variances  are  shown  in  Figure  3.7  and 
tabulated  in  Appendix  C.  For  this  figure,  the  input  noise  variances  in  the  x-  and  y- 
direction  were  varied  independently  from  0.25  up  to  4.0.  The  optimal  linear  weights  for 
the  reconstructor  as  a  function  of  the  input  noise  variances  are  found  according  to 
equations  (17)  and  (18),  and  are  derived  in  Appendix  B. 


(17) 

(18) 
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As  can  be  seen  in  Figure  3.7,  the  output  noise  variance  of  the  phasor-based 
reconstructor  using  optimal  linear  weights  has  a  maximum  of  approximately  3.3.  This 
maximum  occurs  whenever  cx2  and  ay2  are  both  greater  than  approximately  2. 


Figure  3.7.  Output  noise  variance  as  a  function  of 
the  input  noise  variances  in  the  x-  and  y- 
directions  using  optimal  linear  weights  in  the 
phasor  reconstructor. 
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IV.  FINDING  OPTIMAL  PHASOR-BASED  WEIGHTS 


Using  the  PDF  of  <f>n  developed  in  the  last  chapter,  the  optimal  phasor-based 
weights  for  the  reconstructor  can  be  found  for  a  given  set  of  input  noise  variances.  This 
was  done  by  varying  the  optimal  linear-based  weights  (used  for  Figure  3.7)  slightly  until 
the  resultant  output  noise  variance  was  minimized. 

Now  suppose  that  the  optimal  phasor-based  weights  are  to  be  found  for  input 
noise  variances  of  ax2=0.5  and  ay2=0.25.  The  first  step  is  to  find  the  optimal  linear- 
based  weights  corresponding  to  these  input  noise  variances.  From  equations  (17)  and 
(18),  the  linear-based  weights  can  be  found  to  be:  wx  =  1/6  =  0.166  and  wy  =  1/3  =  0.333. 
To  find  the  optimal  phasor-based  weights,  these  values  for  wx  and  wy  are  varied  locally 
about  their  original  values  (i.e.,  wx  is  varied  from  0.1  to  0.2,  which  forces  wy  to  vary  from 
0.4  down  to  0.3  since  the  two  weights  must  sum  to  0.5).  Within  the  new  range  of  the 
weights,  the  values  of  wx  and  wy  that  produce  the  minimum  phasor  reconstructor  output 
noise  variance  are  the  optimal  phasor-based  weights  if  the  range  of  variation  is  large 
enough  to  give  a  minimum.  Figure  4.1  shows  the  variance  of  the  output  noise  when 
optimal  phasor-based  weights  are  used  in  the  phasor  reconstructor.  As  can  be  seen,  the 
output  variance  of  the  reconstructor  is  nearly  always  smaller  in  the  phasor  weight  case 
than  that  for  the  non-phasor  weight  case  (Figure  3.7)  for  a  given  input  noise  variance. 
There  are,  however,  unexplained  ‘ripples’  in  the  phasor  weight  output  variance  plot 
shown  in  Figure  4.1  compared  to  the  ‘smooth’  optimal  linear  weight  case  plot  of  Figure 
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3.7.  It  should  be  noted  that  these  ‘ripples’  are  also  seen  in  Monte-Carlo  simulations  of 
the  output  variance,  so  they  are  probably  a  result  of  system  nonlinearities  rather  than 
artifacts  of  computational  inaccuracies. 

The  data  that  was  used  to  generate  the  plot  in  Figure  4.1  is  shown  in  Appendix  C. 
Also,  Appendix  D  shows  optimal  phasor-based  x-direction  weights  and  the  optimal 
linear-based  x-direction  weights.  The  optimal  y-direction  weights  can  be  found  as 
described  in  Appendix  D.  As  the  data  shows,  the  optimal  phasor-based  x-direction 
weights  are  nearly  always  smaller  than  the  optimal  linear-based  x-direction  weights  for  a 
given  set  of  input  noise  variances  in  the  x-  and  y-directions. 

Figure  4.2  shows  the  difference  of  the  output  variance  of  the  phasor  reconstructor 
with  optimal  linear  weights  and  the  output  variance  of  the  same  reconstructor  with 
optimal  phasor  weights.  The  data  from  this  figure  is  shown  in  Appendix  C.  Figure  4.2 
shows  that  the  output  noise  variance  of  the  phasor  reconstructor  with  optimal  linear 
weights  is  nearly  always  greater  than  the  output  noise  variance  of  the  same  reconstructor 
using  optimal  phasor  weights.  It  should  be  noted  that  there  are  a  few  cases,  namely  for 
small  input  noise  variance,  for  which  this  is  not  the  case. 
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output  noise  variance 


input  noise  variance  in  y-dir 


0  0 


input  noise  variance  in  x-dir 


Figure  4. 1 .  Output  noise  variance  using  optimal  phasor- 
based  weights  in  the  phasor  reconstructor. 
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linear  weight  output  variance  minus  phasor  weight  output  variance 


(0 


Figure  4.2.  This  figure  shows  the  difference  of  Figures 
3.7  and  4.1 .  This  is  the  output  noise  variance  of  the  phasor 
reconstructor  with  optimal  linear  weights  minus  the 
output  noise  variance  of  the  phasor  reconstructor  with 

optimal  phasor  weights. 


8-21 


V.  NOISE  PERFORMANCE:  PHASOR  VERSUS  NON-PHASOR 


There  are  several  ways  that  the  noise  performance  of  the  reconstructors  can  be 
analyzed.  The  three  methods  of  comparison  that  will  be  discussed  are  noise  gain, 
optimality  of  weights  and  robustness  to  errors  in  the  input  noise  variance  estimates. 

5.1  Noise  Gain 

The  noise  gain  is  defined  as  the  ratio  of  the  output  noise  variance  and  the  input 
noise  variance  of  the  reconstructor.  Figure  5.1  compares  the  noise  gains  of  the  phasor- 
based  and  non-phasor-based  reconstructors  where  the  input  noise  variances  in  the  x-  and 
y-directions  are  equal  and  optimal  linear  weights  are  used  in  both.  From  Figure  5.1,  it 
can  be  seen  that  the  two  reconstructors  have  nearly  identical  noise  gains  when  the  input 
noise  variance,  an2,  is  less  than  approximately  0.3.  The  noise  gain  is  defined  as  E2/ct„2  = 
1/2  for  the  non-phasor  case  and  a02/an2  for  the  phasor  case  [7],  As  ct„2  increases,  the 
noise  gain  of  the  phasor-based  reconstructor  grows  significantly  larger  than  the  non- 
phasor  reconstructor.  The  phasor-based  noise  gain  will  again  equal  the  non-phasor  noise 
gain  when  an2  is  approximately  6.58.  Therefore,  for  the  range  of  an2  that  we  are 
interested  in,  the  phasor-based  reconstructor  noise  gain  is  larger  than  the  gain  of  the 
linear-based  reconstructor. 
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Figure  5.1.  This  is  a  comparison  of  the  noise  gains  of  the 
two  different  reconstructors.  The  phasor-based 
reconstructor  has  a  larger  noise  gain  than  the  non-phasor 
based  reconstructor  over  most  of  this  range  of  input  noise 
variances. 

5.2  Robustness 

It  is  shown  in  Figure  5.2  that  the  phasor-based  reconstruction  approach  is,  in 
general,  more  robust  to  errors  in  the  input  noise  variance  estimations  than  is  the  linear- 
based  approach.  This  can  be  seen  by  noting  that  the  phasor-case  curve  is  wider  and  more 
‘shallow’  than  the  non-phasor  curve.  This  means  that  a  small  variation  in  the  input  noise 
variance  will  produce  only  a  small  variation  in  the  phasor-based  reconstructor  output 
noise  variance. 


8-23 


Phasor  Case 


Non-Phasor  Case 


wx 

Figure  5.2  Plot  shows  that  robustness  of  the  phasor-based 
reconstructor  compared  to  the  linear-based  reconstructor. 

The  phasor-based  technique  is  much  more  robust  than  the 
linear-based,  as  can  be  seen  from  the  wider  curve.  For  this 
plot,  crx2=2  and  ay2=2. 

5.3  Optimality 

Figure  5.3  is  an  example  that  shows  that,  for  the  phasor  reconstructor,  the  linear 
weights  are  not  always  optimal.  The  optimal  linear  x-direction  weight  for  the  linear 
reconstructor  is  0.2  for  the  case  shown  in  Figure  5.3.  But  from  the  figure,  it  can  be  seen 
that  the  optimal  phasor-based  x-direction  weight  for  the  phasor  reconstructor  is  0.18. 
Therefore,  the  optimal  phasor  weights  are  the  weights  of  choice  if  the  output  variance 
from  the  phasor  reconstructor  is  to  be  minimized.  The  optimality  of  the  phasor  weights 
for  large  crx2  and  ay2  can  also  be  realized  by  noting  that  the  maximum  value  of  Figure  4.1, 
(~3.0),  is  smaller  than  the  maximum  value  of  Figure  3.7,  (~3.3).  For  Figure  5.2  and 
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Figure  5.3  the  linear  reconstructor  output  variance  was  found  according  to  the  method 
shown  in  Appendix  B. 


Phasor  Case  Non-Phasor  Case 


Figure  5.3.  Plot  shows  that  a  phasor-based  x-direction 
weight  can  be  found  which  is  more  optimal  than  the 
optimal  linear  x-direction  weight  for  a  given  input  noise 
variance.  For  this  plot,  ctx2=1.5  and  ay2=1.0. 


Figures  5.4  and  5.5  show  the  output  noise  variances  of  the  two  reconstructors  as 
the  x-direction  weight  is  varied.  For  the  case  in  Figure  5.4,  ax2  =  3.75  and  ay2  =  0.25. 
Once  again,  the  optimal  linear-based  x-direction  weight  is  not  the  same  as  the  optimal 
phasor-based  x-direction  weight.  For  the  case  of  Figure  5.5,  ctx2  =  3.75  and  cry2  =  3.75.  In 
this  case,  since  both  input  noise  variances  are  equal,  the  optimal  phasor-based  weight  is 
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output  noise  variance 


the  same  as  the  optimal  non-phasor-based  weight.  The  figure  also  shows  that  the  phasor 
case  is  much  more  robust  to  errors  in  input  noise  variance  than  is  the  non-phasor  case. 


Phasor  Case  Non-Phasor  Case 


wx 


Figure  5.4.  Plot  shows  the  difference  of  the  optimal  x- 
direction  weights  for  the  phasor  and  non-phasor 
reconstructors  for  ax2=  3 .75  and  ay2=0.25 . 
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wx 


Figure  5.5.  Plot  where  there  is  no 
difference  between  the  optimal 
phasor-based  x-direction  weight  and 
the  optimal  non-phasor-based  x- 
direction  weight  since  both  input 
noise  variances  are  equal.  In  this 
case  <tx2=  3.75  and  cry2=3.75. 

VI.  CONCLUSIONS  AND  FUTURE  WORK 
The  traditional  least  mean  square  (LMS)  phase  reconstructor  is  susceptible  to  2n 
phase  ambiguities  at  points  of  zero  phase  field  amplitude  when  applied  to  speckle  field 
patterns.  A  phasor-based  phase  reconstructor  has  been  proposed  in  hopes  of  eliminating 
this  problem.  The  goal  of  this  paper  was  to  analyze  the  output  noise  characteristics  of 
path  averaging  operations  in  a  phasor-based  phase  reconstructor.  An  analytic  technique 
was  described  for  finding  the  probability  density  function  (PDF)  and  the  variance  of  the 
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variances  were  the  input  parameters  to  the  PDF  algorithm.  The  resulting  PDF  was  used 
to  find  the  optimal  phasor-based  weights  in  order  to  minimize  the  output  noise  variance 
of  the  reconstructor.  The  robustness  of  the  phasor-based  reconstructor  to  errors  in  the 
input  noise  variance  approximations  was  studied.  These  studies  concluded  that,  in 
general,  the  phasor-based  reconstructor  produced  results  that  were  more  robust  to  errors 
in  input  noise  estimates  as  compared  to  the  common  LMS  reconstructor.  Also,  it  was 
shown  that  the  optimal  linear-based  weights  will  not  always  produce  the  minimum 
reconstructor  output  noise  gain  when  used  in  the  phasor-based  reconstructor.  For  this 
reason,  the  optimal  phasor-based  weights  should  always  be  used  when  using  the  phasor- 
based  reconstructor.  Future  work  should  include  investigating  the  effects  of  correlated 
input  noise  on  the  reconstructor  output  noise. 
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APPENDIX  A 


DERIVATION  OF  PDF  OF  W  SIN(T) 


The  PDF  of  equation  (Al)  is  to  be  found: 

U  =  G(s)  =  w-sin(T)  =  w-sin(s).  (Al) 

Since  U=w-sin(s)  is  a  nonmonotonic  function,  equation  (A2)  is  used  to  find  its 
PDF.  Where  each  of  the  sn’s  represent  a  solution  to  sn=  arcsin(U/w).  Since  equation  (Al) 
is  nonmonotonic,  there  are  an  infinite  number  of  sn  solutions. 


fs(so)  |  fs(si) 

uW  |d  G(s)  /  ds|s=s0  |d  G(s)  /  ds|s=sl 


(A2) 


where  fs(s)  = 


^/2-7t-(2-an2) 


exP(  (-s2)/(2-(2-Gn2))  ), 


and 


d  G(s) 


ds 


_  dw-sin(s) 


S=Sn 


ds 


s=s, 


=  w-cos(s0) 


=  a/w2-U2, 

d  G(s) 
ds 

=  w-cos(7t-sin‘1(u/w))  =  -  w-cos(s0) 


_  d  w-sin(s) 


s=s. 


ds 


s=s, 


=  W'COS(Sj) 


and  s0=sin'1(u/w),  s,=7t-s0,  s2=2-7t+s0, 


S3  3  *7C-S0 
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Now,  substituting  into  equation  (A2),  the  PDF  of  G(x)=w-sin(x)  is  found  to  be: 


APPENDIX  B 

FINDING  THE  OPTIMAL  LINEAR  WEIGHTS 
AND  OUTPUT  VARIANCE  FOR  THE  LINEAR 
RECONSTRUCTOR 

Equation  (Bl)  represents  the  noise  portion  of  equation  1.  The  first  two  terms  in 
equation  (Bl)  represent  the  x-direction  noise  and  the  last  two  terms  represent  the  y- 
direction  noise  terms. 


Now,  lefw]=w2=wx  and  w3=w4=wy, 

So  On=2-wx-  ^  iij  +  2-wy-  ^  tij 

li=l  J  Vj=l 

Therefore,  G&2  =  2-wx2-(2an2)+2-wy2-(2*an2), 

where  on2  is  the  variance  of  the  individual  noise  terms,  % 

Now,  assume  that  the  variance  of  the  individual  noise  terms  in  the  x-direction  is  ctx2  and 
the  individual  noise  terms  in  the  y-direction  have  variance  cy2. 

Then  a^2  =  2-wx2-(2ax2)+2-wy2-(2-cry2) 

=  4-(  wx2-cx2+wy2-CTy2). 
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Since  originally,  w,+w2+w3+W4=l,  we  now  have  the  constraint  wx+wy=0.5,  or 
wy=0.5-wx.  Now,  to  optimize  a02,  we  set: 

_  Q  _  d  [4-(wx2-ctx2+(0.5-wx)2-ctv2)] 

dwx  dwx 


=  4-(2-wx)-ctx2  +  (4>(-2>(0.5-wx>ay2  =  0. 


Therefore, 

wx,0pt  =  (^y2)/(2-(cjx2+ay2)) 

(B2) 

and 

wy>opt  =  (CTx2)/(2-(ax2+CTy2)). 

(B3) 

Thus, 

^2<t,opi  =  4-(w2xopt-ax2+w2yopt-ay2). 

(B4) 

Now,  substituting  equations  (B2)  and  (B3)  into  equation  (B4)  and  simplifying, 
CT^op,  is  found  to  be: 

2  _  gx/gy2_ 

CT  -**•  0X2+Oy2' 

This  is  the  output  noise  variance  of  the  linear  reconstructor  using  optimal  linear 
weights.  Equations  (B2)  and  (B3)  represent  the  optimal  linear  based  weights  for  the  x- 
and  y-direction  terms  in  the  reconstructor. 
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APPENDIX  C 


RECONSTRUCTOR  OUTPUT 
VARIANCE  DATA 


This  data  represents  the  output  noise  variance  of  the  phasor  reconstructor  for 


optimal  linear-based  weights  when  the  input  noise  variances  in  the  x-  and  y-directions 
vary  from  0.25  up  to  4.0.  This  data  is  plotted  in  Figure  3.7. 


Table  C.l  Phasor  reconstructor  with  optimal  linear  weights  for  ctx2  and  cry2  varied 

from  0.25  up  to  4.0. _ 

a2  0250  0500  0.750  lJOOO  1250  1500  1.750 


0250 

0.030 

0.068 

0.127 

0.208 

0.291 

0.356 

0.396 

0500 

0.068 

0.121 

0.220 

0.372 

0.530 

0.655 

0.732 

0.750 

0.127 

0.220 

0.422 

0.707 

0.981 

1.186 

1.309 

UJ00 

0.208 

0.372 

0.707 

1.130 

1.507 

1.776 

1.935 

1250 

0.291 

0.530 

0.981 

1.507 

1.953 

2.263 

2.444 

1500 

0.356 

0.655 

1.186 

1.776 

2.263 

2.596 

2.788 

1.750 

0.396 

0.732 

1.309 

1.935 

2.444 

2.788 

2.986 

2000 

0.416 

0.771 

1.371 

2.014 

2.533 

2.883 

3.083 

2250 

0.425 

0.787 

1.398 

2.048 

2.570 

2.923 

3.124 

2500 

0.428 

0.794 

1.408 

2.060 

2.584 

2.937 

3.139 

2.750 

0.430 

0.796 

1.411 

2.064 

2.588 

2.942 

3.143 

3000 

0.430 

0.797 

1.412 

2.065 

2.589 

2.943 

3.144 

3250 

0.431 

0.797 

1.412 

2.065 

2.589 

2.942 

3.144 

3500 

0.432 

0.798 

1.412 

2.064 

2.588 

2.941 

3.143 

3.750 

0.433 

0.798 

1.412 

2.064 

2.587 

2.940 

3.141 

4000 

0.434 

0.800 

1.413 

2.063 

2.586 

2.938 

3.139 

2JO00 

2250 

2.500 

2.750 

3JOOO 

3.250 

3500 

3.750 

4J000 

0.416 

0.425 

0.428 

0.430 

0.430 

0.431 

0.432 

0.433 

0.434 

0.771 

0.787 

0.794 

0.796 

0.797 

0.797 

0.798 

0.798 

0.800 

1.371 

1.398 

1.408 

1.411 

1.412 

1.412 

1.412 

1.412 

1.413 

2.014 

2.048 

2.060 

2.064 

2.065 

2.065 

2.064 

2.064 

2.063 

2.533 

2.570 

2.584 

2.588 

2.589 

2.589 

2.588 

2.587 

2.586 

2.883 

2.923 

2.937 

2.942 

2.943 

2.942 

2.941 

2.940 

2.938 

3.083 

3.124 

3.139 

3.143 

3.144 

3.144 

3.143 

3.141 

3.139 

3.181 

3.222 

3.237 

3.242 

3.243 

3.242 

3.241 

3.239 

3.237 

3.222 

3.264 

3.279 

3.284 

3.285 

3.284 

3.283 

3.281 

3.279 

3.237 

3.279 

3.294 

3.299 

3.300 

3.299 

3.298 

3.296 

3.294 

3.242 

3.284 

3.299 

3.304 

3.305 

3.304 

3.303 

3.301 

3.299 

3.243 

3.285 

3.300 

3.305 

3.305 

3.305 

3.304 

3.302 

3.300 

3.242 

3.284 

3.299 

3.304 

3.305 

3.304 

3.303 

3.301 

3.299 

3.241 

3.283 

3.298 

3.303 

3.304 

3.303 

3.302 

3.300 

3.298 

3.239 

3.281 

3.296 

3.301 

3.302 

3.301 

3.300 

3.298 

3.296 

3.237 

3.279 

3.294 

3.299 

3.300 

3.299 

3.298 

3.296 

3.294 

This  is  the  raw  data  from  the  ‘optphase.m’  m-file.  This  data  represents  the  output 
noise  variance  of  the  phasor  reconstructor  for  optimal  phasor  weights  when  the  input 
noise  variances  in  the  x-  and  y-directions  vary  from  0.25  up  to  4.0.  This  data  is  plotted  in 


Figure  4.1. 


Table  C.2  Phasor  reconstructor  with  optimal  phasor  weights  for  ax2  and  <jy2 
from  0.25  up  to  4.0.  _ 


0250 

0.500 

0.750 

UD00 

0.250 

0.115 

0.164 

0.186 

0.204 

0.500 

0.164 

0.293 

0.402 

0.493 

0.750 

0.186 

0.402 

0.607 

0.795 

1JD00 

0.204 

0.493 

0.795 

i.000 

1.250 

0.219 

0.541 

0.905 

1.214 

1.500 

0.225 

0.552 

0.975 

1.285 

1.750 

0.230 

0.600 

0.990 

1.397 

2J000 

0.231 

0.591 

1.056 

1.450 

2.250 

0.228 

0.611 

1.029 

1.473 

2.500 

0.234 

0.622 

1.083 

1.494 

2.750 

0.234 

0.616 

1.096 

1.524 

3JD00 

0.236 

0.625 

1.063 

1.473 

3.250 

0.234 

0.623 

1.086 

1.525 

3500 

0.236 

0.629 

1.105 

1.548 

3.750 

0.237 

0.620 

1.100 

1.544 

4JD00 

0.232 

0.622 

1.080 

1.499 

1.250 

1300 

1.750 

2J000 

2250 

0.219 

0.225 

0.230 

0.231 

0.228 

0.541 

0.552 

0.600 

0.591 

0.611 

0.905 

0.975 

0.990 

1.056 

1.029 

1.214 

1.285 

1.397 

1.450 

1.473 

1.394 

1.607 

1.716 

1.788 

1.827 

1.607 

1.749 

1.943 

2.043 

2.040 

1.716 

1.943 

2.051 

2.232 

2.301 

1.788 

2.043 

2.232 

2.299 

2.463 

1.827 

2.040 

2.301 

2.463 

2.500 

1.853 

2.138 

2.342 

2.526 

2.638 

1.886 

2.167 

2.392 

2.562 

2.603 

1.903 

2.177 

2.421 

2.525 

2.731 

1.883 

2.176 

2.348 

2.611 

2.748 

1.902 

2.148 

2.432 

2.608 

2.758 

1.845 

2.193 

2.456 

2.611 

2.775 

1.924 

2.218 

2.462 

2.631 

2.718 

2500 

2.750 

3JOOO 

3.250 

3500 

0.234 

0.234 

0.236 

0.234 

0.236 

0.622 

0.616 

0.625 

0.623 

0.629 

1.083 

1.096 

1.063 

1.086 

1.105 

1.494 

1.524 

1.473 

1.525 

1.548 

1.853 

1.886 

1.903 

1.883 

1.902 

2.132 

2.167 

2.177 

2.176 

2.148 

2.342 

2.392 

2.421 

2.348 

2.432 

2.526 

2.562 

2.525 

2.611 

2.608 

2.638 

2.605 

2.731 

2.748 

2.758 

2.661 

2.792 

2.833 

2.857 

2.878 

2.792 

2.788 

2.907 

2.940 

2.888 

2.833 

2.907 

2.888 

2.925 

3.013 

2.857 

2.940 

2.925 

2.967 

3.063 

2.878 

2.888 

3.013 

3.063 

3.029 

2.819 

2.970 

3.042 

3.090 

3.125 

2.901 

2.974 

3.053 

3.105 

3.140 

varied 


3.750  4JOOO 


0.237  0.232 

0.620  0.622 
1.100  1.080 
1.544  1.499 

1.845  1.924 

2.193  2.218 

2.456  2.462 

2.611  2.631 

2.775  2.718 

2.819  2.901 

2.970  2.974 

3.042  3.053 

3.090  3.105 

3.125  3.140 

3.078  3.153 

3.153  3.115 
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This  is  the  raw  data  for  the  plot  shown  in  Figure  4.2.  This  data  represents  the 
difference  of  the  output  noise  variance  of  the  phasor  reconstructor  using  optimal  linear- 
based  weights  minus  the  output  noise  variance  of  the  phasor  reconstructor  using  optimal 
phasor-based  weights. 


Table  C.3  Difference  of  the  data  in  Table  C.l  minus  data  in  Table  C.2. 


0250  0.500  0.750  1JJ00  1JD25  1300  1.750  2JD0O  2250  2300  2.750  3JOOO  3250  3300  3.750  4.000 


0250 

-0585 

-0596 

-0558 

0.005 

0300 

-0.096 

-0.172 

-0.181 

-0.121 

0.750 

-0.058 

-0.181 

-0.185 

-0.088 

1.000 

0.005 

-0.121 

-0.088 

0.130 

1250 

0.073 

-0511 

0.076 

0.293 

1300 

0.131 

0.102 

0.211 

0.492 

1.750 

0.166 

0.132 

0.319 

0.538 

2JD00 

0.186 

0.179 

0.315 

0.564 

2250 

0.197 

0.177 

0.369 

0.575 

2500 

0.195 

0.172 

0.325 

0.566 

2.750 

0.196 

0.180 

0.315 

0.540 

3500 

0.194 

0.172 

0.349 

0.592 

3250 

0.197 

0.174 

0.326 

0.540 

3500 

0.196 

0.169 

0.307 

0.517 

3.750 

0.196 

0.178 

0.313 

0.520 

4500 

0.202 

0.177 

0.333 

0.564 

0.073 

0.131 

0.166 

0.186 

0.197 

-0511 

0.102 

0.132 

0.179 

0.177 

0.076 

0.211 

0.319 

0.315 

0.369 

0.293 

0.492 

0.538 

0.564 

0.575 

0.559 

0.656 

0.727 

0.745 

0.744 

0.656 

0.848 

0.845 

0.840 

0.883 

0.727 

0.845 

0.935 

0.850 

0.823 

0.745 

0.840 

0.850 

0.881 

0.759 

0.744 

0.883 

0.823 

0.759 

0.763 

0.731 

0.799 

0.797 

0.711 

0.641 

0.702 

0.775 

0.751 

0.680 

0.681 

0.686 

0.765 

0.723 

0.718 

0.553 

0.706 

0.766 

0.796 

0.632 

0.536 

0.687 

0.793 

0.710 

0.633 

0.525 

0.742 

0.747 

0.685 

0.628 

0.506 

0.663 

0.720 

0.677 

0.607 

0.561 

0.195 

0.196 

0.194 

0.197 

0.196 

0.172 

0.180 

0.172 

0.174 

0.169 

0.325 

0.315 

0.349 

0.326 

0.307 

0.566 

0.540 

0.592 

0.540 

0.517 

0.731 

0.702 

0.686 

0.706 

0.687 

0.799 

0.775 

0.765 

0.766 

0.793 

0.797 

0.751 

0.723 

0.796 

0.710 

0.711 

0.680 

0.718 

0.632 

0.633 

0.641 

0.679 

0.553 

0.536 

0.525 

0.633 

0.507 

0.467 

0.442 

0.420 

0.507 

0.516 

0.397 

0.364 

0.415 

0.467 

0.397 

0.417 

0.380 

0.290 

0.442 

0.364 

0.380 

0.337 

0.240 

0.420 

0.415 

0.290 

0.240 

0.273 

0.477 

0.331 

0.260 

0.211 

0.175 

0.393 

0.325 

0.246 

0.194 

0.157 

0.196 

0.202 

0.178 

0.177 

0.313 

0.333 

0.520 

0.564 

0.742 

0.663 

0.747 

0.720 

0.685 

0.677 

0.628 

0.607 

0.506 

0.561 

0.477 

0.393 

0.331 

0.325 

0.260 

0.246 

0.211 

0.194 

0.175 

0.157 

0.221 

0.143 

0.143 

0.179 

APPENDIX  D 

OPTIMAL  X-DIRECTION  WEIGHTS 

D.l  Optimal  Phasor-Based  X-Direction  Weights 
This  data  represents  the  optimal  phasor-based  x-direction  weights  for  input  noise 
variances  in  both  the  x-  and  y-directions  varying  from  0.25  up  to  4.0.  The  optimal 
phasor-based  y-direction  weights  for  a  given  x-  and  y-direction  input  noise  variance  can 
be  found  by  subtracting  the  corresponding  x-direction  weight  from  0.5. 
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Table  D.  1  Optimal  phasor-based  x-direction  weights. 

O,1  0250  0500  0.750  1JOOO  1250  1  500  1.750 


* 

0250 

0.250 

0.333 

0.375 

0.400 

0.427 

0.449 

0.458 

0500 

0.167 

0.250 

0.300 

0.343 

0.377 

0.395 

0.419 

0.750 

0.125 

0.200 

0.250 

0.296 

0.333 

0.363 

0.380 

1000 

0.100 

0.157 

0.204 

0.250 

0.288 

0.320 

0.358 

1250 

0.073 

0.123 

0.168 

0.212 

0.250 

0.283 

0.322 

1500 

0.051 

0.105 

0.137 

0.180 

0.217 

0.250 

0.289 

1.750 

0.043 

0.081 

0.120 

0.142 

0.178 

0.211 

0.250 

2000 

0.036 

0.060 

0.086 

0.117 

0.152 

0.184 

0.213 

2250 

0.020 

0.041 

0.075 

0.104 

0.129 

0.160 

0.179 

2500 

0.016 

0.033 

0.065 

0.093 

0.117 

0.138 

0.156 

2.750 

0.022 

0.027 

0.057 

0.083 

0.106 

0.127 

0.144 

3000 

0.019 

0.021 

0.050 

0.075 

0.097 

0.117 

0.134 

3250 

0.006 

0.017 

0.044 

0.068 

0.089 

0.108 

0.125 

3500 

0.003 

0.023 

0.038 

0.061 

0.082 

0.100 

0.117 

3.750 

0.011 

0.009 

0.033 

0.055 

0.075 

0.093 

0.109 

4000 

>0.001 

0.006 

0.029 

0.050 

0.069 

0.086 

0.102 

2D  00 

2.250 

2500 

2.750 

3D  OO 

3250 

3500 

3.750 

4D00 

0.464 

0.480 

0.485 

0.478 

0.482 

0.494 

0.497 

0.489 

0.501 

0.440 

0.459 

0.467 

0.473 

0.479 

0.483 

0.488 

0.491 

0.494 

0.414 

0.425 

0.435 

0.443 

0.450 

0.456 

0.462 

0.467 

0.471 

0.383 

0.396 

0.407 

0.417 

0.425 

0.432 

0.439 

0.445 

0.450 

0.348 

0.371 

0.383 

0.394 

0.403 

0.411 

0.418 

0.425 

0.431 

0.316 

0.340 

0.363 

0.374 

0.383 

0.392 

0.400 

0.407 

0.414 

0.287 

0.321 

0.344 

0.356 

0.366 

0.375 

0.383 

0.391 

0.398 

0.250 

0.285 

0.308 

0.340 

0.350 

0.360 

0.368 

0.376 

0.383 

0.215 

0.250 

0.303 

0.285 

0.336 

0.346 

0.354 

0.363 

0.370 

0.192 

0.197 

0.250 

0.272 

0.313 

0.333 

0.342 

0.350 

0.358 

0.161 

0.185 

0.228 

0.250 

0.271 

0.301 

0.320 

0.339 

0.346 

0.150 

0.164 

0.187 

0.229 

0.250 

0.280 

0.319 

0.328 

0.336 

0.141 

0.155 

0.167 

0.199 

0.220 

0.250 

0.309 

0.298 

0.316 

0.132 

0.146 

0.158 

0.180 

0.181 

0.201 

0.250 

0.299 

0.307 

0.124 

0.138 

0.150 

0.162 

0.172 

0.202 

0.201 

0.250 

0.308 

0.117 

0.130 

0.142 

0.154 

0.164 

0.184 

0.193 

0.192 

0.240 

D.2  Optimal  Linear-Based  X-Direction  Weights 
This  data  represents  the  optimal  linear-based  x-direction  weights  for  input  noise 
variances  in  both  the  x-  and  y-directions  varying  from  0.25  up  to  4.0.  The  optimal  linear- 
based  y-direction  weights  for  a  given  x-  and  y-direction  input  noise  variance  can  be  found 
by  subtracting  the  corresponding  x-direction  weight  from  0.5. 


Table  D.2  Optimal  linear-based  x-direction  weights. 


0.250  0500  0.750  1D00  1250  1500  1.750  2000  2250  2500  2.750  3DOO  3250  3500  3J50  4JOOO 


0250 

0.250 

0.333 

0.375 

0500 

0.167 

0.250 

0.300 

0.750 

0.125 

0.200 

0.250 

1D00 

0.100 

0.167 

0.214 

1250 

0.083 

0.143 

0.188 

1500 

0.071 

0.125 

0.167 

1.750 

0.063 

0.111 

0.150 

2D00 

0.056 

0.100 

0.136 

2250 

0.050 

0.091 

0.125 

2500 

0.046 

0.083 

0.115 

2.750 

0.042 

0.077 

0.107 

3D00 

0.039 

0.071 

0.100 

3250 

0.036 

0.067 

0.094 

3500 

0.033 

0.063 

0.088 

3.750 

0.031 

0.059 

0.083 

4D00 

0.029 

0.056 

0.079 

0.400 

0.417 

0.429 

0.438 

0.333 

0.357 

0.375 

0.389 

0.286 

0.313 

0.333 

0.350 

0.250 

0.278 

0.300 

0.318 

0.222 

0.250 

0.273 

0.292 

0.200 

0.227 

0.250 

0.269 

0.182 

0.208 

0.231 

0.250 

0.167 

0.192 

0.214 

0.233 

0.154 

0.179 

0.200 

0.219 

0.143 

0.167 

0.188 

0.206 

0.133 

0.156 

0.177 

0.194 

0.125 

0.147 

0.167 

0.184 

0.118 

0.139 

0.158 

0.175 

0.111 

0.132 

0.150 

0.167 

0.105 

0.125 

0.143 

0.159 

0.100 

0.119 

0.136 

0.152 

0.444 

0.450 

0.455 

0.458 

0.400 

0.409 

0.417 

0.423 

0.364 

0.375 

0.385 

0.393 

0.333 

0.346 

0.357 

0.367 

0.308 

0.321 

0.333 

0.344 

0.286 

0.300 

0.313 

0.324 

0.267 

0.281 

0.294 

0.306 

0.250 

0.265 

0.278 

0.290 

0.235 

0.250 

0.263 

0.275 

0.222 

0.237 

0.250 

0.262 

0.211 

0.225 

0.238 

0.250 

0.200 

0.214 

0.227 

0.239 

0.191 

0.205 

0.217 

0.229 

0.182 

0.196 

0.208 

0.220 

0.174 

0.188 

0.200 

0.212 

0.167 

0.180 

0.192 

0.204 

0.462 

0.464 

0.467 

0.469 

0.429 

0.433 

0.438 

0.441 

0.400 

0.406 

0.412 

0.417 

0.375 

0.382 

0.389 

0.395 

0.353 

0.361 

0.368 

0.375 

0.333 

0.342 

0.350 

0.357 

0.316 

0.325 

0.333 

0.341 

0.300 

0.310 

0.318 

0.326 

0.286 

0.296 

0.304 

0.313 

0.273 

0.283 

0.292 

0.300 

0.261 

0.271 

0.280 

0.289 

0.250 

0.260 

0.269 

0.278 

0.240 

0.250 

0.259 

0.268 

0.231 

0.241 

0.250 

0.259 

0.222 

0.232 

0.241 

0.250 

0.214 

0.224 

0.233 

0.242 

0.471 

0.444 

0.421 

0.400 

0.381 

0.364 

0.348 

0.333 

0.320 

0.308 

0.296 

0.286 

0.276 

0.267 

0.258 

0.250 


8-36 


Mathematical  Modeling  of  Thermionic- AMTEC  Cascade  System  for  Space  Power 


M.A.K.  Lodhi 
Professor 

Department  of  Physics 


Texas  Tech  University 
Lubbock,  TX  79409 


Final  Report  for: 

Summer  Research  Extension  Program 
Phillips  Laboratory 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  Air  Force  Base,  DC 

and 

Phillips  Laboratory 


October  1996 


9-1 


MATHEMATICAL  MODELING  OF  THERMIONIC-AMTEC 
CASCADE  SYSTEM  FOR  SPACE  POWER 


M.  A.  K.  Lodhi 
Professor 

Department  of  Physics 
Texas  Tech  Universty 

Abstract 

A  mathematical  modeling  of  the  function  of  a  system  consisting  of  a  cascade  of  a  thermionic  energy  conversion  (TEEC) 
device  and  an  alkali  metal  thermal  to  electrical  conversion  (AMTEC)  device  has  been  performed.  The  AMTEC  utilizes 
this  reject  heat  of  the  TIEC.  A  mathematical  thermal  model  of  the  cascade  converter  has  been  developed  to  analyze 
effects  of  key  parameters  such  as  power  level,  heat  fluxes,  temperatures,  cascade  geometry,  etc.  In  this  effort,  a  9-node 
system  of  nonlinear  simultaneous  equations  has  been  constructed  which  is  solved  by  MATHCAD  predicting  the 
temperatures  of  the  principal  components  and  the  heat  flow.  Through  this  study,  a  better  understanding  of  the  thermal 
coupling  of  the  two  converters  was  gained  which  helps  to  produce  a  more  efficient  cascade. 
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MATHEMATICAL  MODELING  OF  THERMIONIC-AMTEC 
CASCADE  SYSTEM  FOR  SPACE  POWER 

M.  A.  ICLodhi 


Introduction 


The  purpose  of  this  study  is  to  investigate  the  compatibility  of  such  a  tandem  device  which  cascades  two  complementary 
static  devices  and  to  develop  a  mathematical  model  for  matching  the  thermal  impedances  of  the  two  converters  resulting 
in  a  more  efficient  cascade. 

The  cascade  consists  of  a  thermionic  energy  converter  (TIEC)  and  an  alkali  metal  thermoelectric  converter  (AMTEC). 
The  reason  for  selecting  these  converters  is  for  the  temperature  at  which  the  TIEC  rejects  its  heat  is  in  the  range  of  the 
temperature  suitable  for  the  AMTEC  to  receive  the  heat  With  little  modification,  a  TIEC  can  tandem  with  AMTEC  to 
make  a  cascade  device.  To  better  describe  our  research,  we  will  give  first  brief  descriptions  of  the  TIEC,  the  AMTEC, 
and  the  cascade. 

The  goal  for  static  conversion  systems  has  been  to  identify  static  converters  with  efficiencies  that  are  competitive  with 
dynamic  system.  Alternatively  two  compatible  static  devices  may  be  linked  together  in  a  tandem  form  to  increase  the 
efficiency  for  space  power  system.  The  purpose  of  this  research  is  to  investigate  into  the  compatibility  of  such  a  tandem 
device  which  cascades  two  complementary  static  devices  and  develops  a  mathematical  model  to  help  the  designing  of 
the  matching  of  the  thermal  impedances  of  the  two  converters  resulting  in  a  more  efficient  cascade.  The  cascade  under 
investigation  consists  of  a  thermionic  energy  converter  (TIEC)  and  an  alkali  metal  thermoelectric  converter  (AMTEC). 
Fortuitously  the  temperature  at  which  TIEC  rejects  its  heat  is  in  the  range  of  the  temperature  suitable  for  AMTEC  to 
receive  the  heat  With  little  modification  a  TIEC  can  tandem  with  AMTEC  to  make  a  cascade  device. 

A  thermionic  converter  is  a  static  device  which  converts  heat  energy  into  electricity  through  the  surface  emission  of 
electrons.  Fig.  1  gives  a  schematic  diagram  of  parts  and  working  of  a  typical  TIEC.  It  consists  of  an  emitter  (or 
cathode)  which  at  one  surface  receives  heat  from  a  suitable  source  and  emits  electrons  from  the  other.  A  collector  (or 
anode)  collects  these  electrons  and  is  cooled  to  a  lower  temperature  than  the  emitter  to  limit  the  back  emission  of 
electrons.  The  anode  and  cathode  are  connected  by  electrical  leads  to  supply  this  generated  power  to  the  load. 

The  rate  of  the  electron  emission  from  the  hot  metal  surface  is  a  function  of  metal  temperature  and  the  work  function. 
The  kinetic  energy  of  electrons  within  the  bulk  metal  increases  with  increasing  temperature  allowing  a  higher  fraction  of 
them  to  escape  from  the  metal  surface. 

To  reduce  the  space  charge  effect  the  gap  between  the  emitter  and  collector  is  filled  with  plasma  of  cesium  vapor. 
Cesium  has  the  lowest  ionization  potential  and  its  absorption  reduces  the  surface  work  function  of  the  electrodes  lower 
than  liquid  cesium  or  any  metal  [1].  The  converter  stably  generated  power  at  80kW/m2  (8W/cm2)  with  an  electrode 
efficiency  of  1 7%  for  5  years  in  the  lab  without  a  failure  [2].  The  typical  temperature  of  TIEC  working  are  1 600  to 
2000K  at  the  receiving  end  of  the  heat  and  800-1 100K  at  the  rejection  end  [3]. 

AMTEC  is  a  relatively  new  type  of  device  based  on  the  principle  of  concentration  cell  conceived  in  late  sixties  [4,5]  as 
illustrated  in  Fig.  2.  A  closed  vessel  is  divided  into  a  high  temperature,  T2,  and  high  pressure  region  contact  with  a  heat 
source  and  low  temperature,  T\  and  low  pressure  region  in  contact  with  a  heat  sink.  These  regions  are  separated  by  a 
barrier  of  beta-alumina  solid  electrolyte  (BASE)  which  has  ionic  conductivity  much  larger  than  the  electronic 
conductivity.  The  BASE  bears  a  porous  metal  electrode  (anode)  which  converts  the  low  pressure  surface  of  the  BASE. 
A  closed  container  is  partially  filled  with  liquid  sodium  working  fluid.  Sodium  ions  diffuse  through  the  BASE  in 
response  to  the  pressure  differential  (gradient  of  Gibbs  free  energy).  Electrical  leads  are  connected  with  the  porous 
anode  and  with  the  high  temperature  liquid  sodium  as  cathode.  When  the  circuit  is  closed  electrons  flow  to  the  porous 
anode  surface  through  the  load  producing  electrical  work.  A  return  line  and  electromagnetic  pump  or  a  wick  circulate 
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the  sodium  work  fluid  through  AMTEC  [4-7],  AMTEC  conversions  efficiency  was  reported  19%  in  1981  [8]  and  a 
power  density  of  1  W/cmJ  [6].  It  hot  region  temperature  is  in  the  range  of  900- 1 30k  [9] ,  quite  compatible  with  the 
general  purpose  heat  source  (GPHS)  [10]  or  the  projected  Sp-100  nuclear  space  power  reactor  [11],  AMTEC  is  limited 
to  1 350K  because  of  the  strength  of  the  BASE.  The  lower  temperature  region  could  be  limited  to  600K  as  the  condenser 
temperature  increases  above  700K  the  efficiency  decreases  quite  rapidly  [1],  AMTEC  has  high  efficiency  near  Carnot 
[12],  low  maintenance  and  high  durability,  modular  construction  and  the  ability  to  use  high  temperature  combustion 
nuclear  or  solar  heat  source  [13], 

Cascade  Assembly 

An  assembly  of  TEC  and  AMTEC  has  been  cascaded  with  a  transition  piece  between  the  collector  of  TEC  and 
AMTEC.  The  transition  piece  is  surrounded  by  a  trim  collar  which  is  heated  at  a  desirable  temperature  by  a  heating 
element.  The  trim  heater  is  surrounded  by  a  calorimeter.  The  cascade  is  schematically  shown  in  Fig.  3.  The  TEC  is 
heated  by  electron  bombardment  portion  of  this  heat  energy  into  electricity  and  rejects  the  rest  from  the  lower  end  of  the 
emitter.  The  rejected  heat  is  picked  up  by  AMTEC  via  the  transition  piece  and  converted  into  electricity. 

At  the  outset  the  cascade  was  broken  down  mathematically  into  as  many  nodes  as  necessary  for  the  complete  analysis  of 
the  converter.  From  the  conservation  of  energy,  the  energy  balance  equations  were  developed  at  each  node.  We 
determined  nine  equations  were  adequate.  These  equations  are  nonlinear  in  temperature  and  were  solved  with  the 
boundary  conditions  determined  at  their  respective  notes.  For  solving  this  set  of  equations  a  software  called 
MATHCAD  was  used. 

Cascade  Modeling 

For  the  purpose  of  keeping  the  track  of  the  heat  and  mass  flow  the  cascade  has  been  broken  down  into  nine  different 
nodes.  At  each  note  the  boundary  conditions  are  determined  for  the  heat  and  mass  flow  through  the  cascade.  This  gives 
rise  to  nine  simultaneous  equations  with  nine  temperatures  at  these  nodes  to  be  solved.  With  the  help  of  the  nodal  circuit 
the  nine  equations  are  developed  and  solved  for  those  temperatures.  In  order  to  solve  these  lengthy  and  complicated 
equations  and  iterative  procedure  is  adopted  with  MATHCAD  5.0.  For  the  iterative  solution  some  initial  values  of  the 
unknowns  are  guessed  and  supplied  for  the  calculations.  The  final  answers  do  not  depend  on  the  initial  values  supplied. 
The  answers  do  depend  on  the  parameters  used  for  material  properties  and  radiation  loss  or  heat  transfer  terms.  The 
radiation  terms  are  in  turn  dependent  on  view  factors  of  the  geometry  of  the  cascade. 

Before  solving  the  nine  nodal  equations  it  is  essential  to  estimate  the  view  factors  at  each  node  as  close  as  possible  to  the 
geometry  of  the  cascade.  This  is  not  a  trivial  job.  Various  view  factors  have  been  evaluated  and  substituted  in  the 
appropriate  places  in  the  nodal  equations  with  corresponding  portions  of  the  areas  of  the  various  components  of  the 
cascade.  The  details  of  calculation  of  the  view  factors  is  avoided  here  to  be  presented  for  the  sake  of  brevity.  The 
development  of  these  expressions  is  given  as  the  input  and  output  data  of  the  MATCHED  compilation  at  the  outset  of 
the  nine  simultaneous  nodal  equations  and  their  solutions. 

The  model  consists  of  nine  nodal  equations  (using  conservation  of  energy)  at  strategic  locations  within  the  cascade.  The 
nodal  points  selected  on  the  cascade  system  were  at  the  emitter,  flange,  bellows,  collector,  transition  piece,  trim  collar, 
AMTEC  evaporator  (in  the  high  temperature  zone),  AMTEC  condenser  (in  the  low  temperature  zone),  and  the 
calorimeter  collar. 

Because  the  cascade  is  to  be  used  in  a  vacuum  environment,  convection  heat  transfer  was  assumed  to  be  non-existent  in 
the  thermal  model.  Therefore,  the  only  heat  transfer  modes  considered  in  the  thermal  model  were  conduction  and 
radiation.  Since  the  solid  components  of  the  cascade  were  assumed  to  be  at  uniform  temperature,  the  only  conduction 
applied  in  the  thermal  modeling  was  from  component  to  component  across  contact  interfaces.  The  heat  transfer  across 
contact  interfaces  was  actually  a  combination  of  conduction  and  radiation.  Since  perfect  contact  between  components  is 
virtually  impossible,  gaps  exist  in  which  the  only  possible  avenue  for  heat  transfer  is  radiation.  The  lack  of  a  fluid 
medium  to  act  as  an  energy  transfer  mechanism  within  the  contact  gaps  of  the  interfaces  increases  the  thermal  resistance. 
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The  interfacial  heat  transfer  between  the  various  cascade  nodal  sections  was  therefore  calculated  by  assuming  the  heat 
flow  was  equal  to  the  temperature  difference  across  the  interface  divided  by  the  thermal  interfacial  resistance.  The 
thermal  interfacial  resistance  was  approximated,  and  it  is  therefore  a  candidate  for  further  investigation. 

Due  to  the  high  temperatures  within  the  device,  radiation  heat  transfer  was  a  major  heat  transfer  mechanism.  A  major 
challenge  involved  in  calculating  radiation  heat  transfer  is  the  calculation  of  the  view  factors.  The  radiation  view  factor 
is  defined  as  the  fraction  of  the  diffuse,  uniform  radiant  energy  leaving  a  surface  that  is  incident  on  another  surface.  The 
view  factors  are  a  function  of  geometry  only,  due  to  the  assumption  of  diffuse,  uniform  radiation.  This  analysis  utilized 
two  geometrically  calculated  view  factors  which  included  parallel  disks  and  cylinder  within  cylinder  [14].  The  actual 
view  factors  were  then  derived  from  these  using  view  factor  algebra  and  enclosure  analysis. 


Results 

Calculational  results  from  this  model  with  a  heat  input  of  3 1 8  W  for  the  electron  gun  and  1 26  W  for  the  trim  heater  are 
given  in  Table  1 ,  followed  by  experimental  results.  The  size  of  the  error  vector  for  the  solution  given  in  Table  1  was 
given  by  MATHCAD  as  0.004. 


TABLE  1 .  Solution  to  Cascade  Nodal  Equations. 


Temp 

E3 JM 

mm 

EK3B 

E3E3I 

ESE3B 

E3551 

EHS 1 

mm 

Calc 

1944 

1264 

1318 

1054 

1229 

11% 

1310 

364 

928 

E9«i 

1801 

1041 

1110 

NR 

933 

875 

NR 

347 

1052 

Discussion 


In  general  the  main  causes  for  the  overprediction  temperature  are  the  boundary  conditions  and  the  assumed  values  of 
thermal  resistance  at  the  various  material  interfaces.  All  boundaries  were  assumed  to  be  adiabatic,  with  the  exception  of 
the  AMTEC  condenser,  which  was  assumed  to  be  in  contact  with  a  constant  temperature  (300  K)  heat  sink.  In  reality, 
none  of  the  boundaries  are  adiabatic,  they  radiate  heat  to  the  vacuum  chamber  walls.  The  only  parts  of  the  device  where 
the  adiabatic  assumption  is  far  from  reality  are  around  the  heaters.  The  electron  beam  heater,  while  shielded  by 
insulation,  appears  to  have  radiated  a  large  amount  of  heat  to  the  chamber.  More  significantly,  this  heater  appears  to 
have  radiated  a  great  deal  of  heat  to  the  calorimeter  (around  the  transition  piece),  causing  it  to  operate  at  a  higher 
temperature  than  predicted.  The  calorimeter  itself  took  out  100  watts  more  than  the  trim  heater  put  in,  due  to  minimum 
flow  requirements  needed  to  avoid  boiling  in  the  cooling  lines.  This  heat  loss,  coupled  with  higher  than  expected 
thermal  resistances,  led  to  the  AMTEC  cell  operating  at  a  much  lower  temperature  than  predicted.  We  believe  that  most 
of  the  thermal  resistance  is  due  to  gaps  opening  up  between  the  collector  and  the  transition  piece,  due  to  thermal 
expansion.  The  copper  plate  attached  to  the  AMTEC  condenser  actually  removed  50  watts  less  than  the  model 
predicted,  but  we  still  had  some  trouble  avoiding  boiling  water  in  the  cooling  lines. 


Conclusion 


The  main  purpose  of  this  study  was  to  understand  the  heat  flow  through  the  cascade  at  various  points,  which  has  been 
achieved  thus  providing  a  test  for  the  model.  The  accuracy  of  this  model  depends  on  a  number  of  parameters.  Some 
discrepancies  noted  between  the  predicted  and  observed  temperatures  in  Table  I  are  attributed  to:  (a)  Material 
properties.  These  properties  were  taken  from  the  catalogs  of  the  material  properties  of  standard  samples  at  various 
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temperature  and  pressure  rather  than  measuring  them  actually,  and  using  them  for  our  model. .  (b)  Thermal  contact 
resistances.  A  good  estimate  of  thermal  contact  resistances  between  various  components  of  TIEC,  AMTEC,  and  other 
ancillary  components  is  necessary  at  the  projected  temperatures  and  pressures.  AJthough  this  information  has  been 
incorporated  as  accurately  as  possible  in  the  model;  it  is,  however,  a  serious  source  of  error.  ©  Radiation  heat  transfer 
(particularly  losses).  This  factor  is  one  of  the  most  important  for  the  model  to  predict  the  temperatures.  For  this  reason, 
the  cascade  geometry  has  been  studied  very  carefully.  Its  geometrical  parameters  affect  the  configuration  (or  view) 
factors  for  the  various  components  of  the  cascade.  These  quantities  have  been  calculated  with  mathematical  exactitude, 
though  involving  some  tedious  and  laborious  algebra.  This  is  one  of  the  least  sources  of  error,  but  the  temperature  terms 
arising  from  the  radiation  losses  vary  as  the  fourth  power  of  temperature  and  thus  could  cause  a  large  error  for  small 
deviation  in  temperatures,  (d)  Nodal  analysis  of  the  cascade:  At  the  outset,  the  cascade  was  broken  down 
mathematically  into  as  many  nodes  as  necessary  for  the  complete  analysis  of  the  converter.  In  this  case  there  are  nine 
nodes.  From  conservation  of  energy,  the  energy  balance  equations  have  been  developed  at  each  node.  These  equations 
are  nonlinear  in  temperature  and  have  been  solved  with  the  boundary  conditions  determined  at  the  respective  nodes. 
Further  increasing  of  the  number  of  nodal  points  would  yield  better  agreement  at  intermediate  points. 

For  solving  this  set  of  equations  a  computer  model  was  developed  for  a  software  called  MATHCAD.  As  a  sample 
calculation  and  a  test  of  the  model,  a  temperature  profile  for  the  cascade  nodal  points  was  obtained  for  a  set  of  input  energy 
parameters  at  the  emitter  and  the  trim  heater. 

This  work  was  supported  in  part  by  the  Air  Force  Office  of  Scientific  Research  (AFOSR)  and  Phillips  Laboratory  Kirtland 
AFB.NM. 
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H  H  H  H  H  H  H  H 


Nomenclature 


The  temperature  of  the  emitter  (K) 

The  temperature  of  the  collector  (K) 

The  temperature  of  the  flange  (K) 

The  temperature  of  the  bellow  (K) 

The  temperature  of  the  transition  piece  (K) 

The  temperature  of  the  trim  collar  (heater)  (K) 

The  temperature  of  the  AMTEC  condenser  (K) 

The  temperature  of  the  calorimeter  (K) 

Q^:  The  heat  input  from  the  electron  gun  into  the  emitter  (W) 

Q trior  The  heat  input  from  trim  heater  into  transition  piece  (K) 
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EMITTER 

(CATHODE) 

EMITTER 
HEAT  CHOKE 


INSULATOR 

COLLECTOR 


[ilil 


1 


Heat 

Input 


\  =  900K  -  1300K 


s  .Te 


\  \  \  \ 


Sodium 

Vapor 


Beta 

Alumina 

Porous 

Electrode 


f  Liquid 
Sodiuml 


V 


_  J 

EM 

Pump 


I  vL,Heat 
l_J  Output 


T|  =  400K  -  800K 
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oni  ict 
3  si:  tanca 


pftter  sleeve,  Mo/Re 
Support  rod.  Mo 


Bellows,  Ni 

_  <°y 

Insulator,  alumina 

Transition  Piece,  Nl 

I  Collector,  Nl  /  ^  Trim  collar,  Ni 


Calorimeter  collar.  Nl 


AMTEC  condenser 


SCHEMATIC  OF  THERMAL  MODEL  OF  CASCADE  TEC/ AMTEC  CONVERTER 
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INCORPORATION  OF  BOUNDARY  CONDITION  MODELS  INTO  THE  AIR  FORCE 
COMPUTER  SIMULATION  PROGRAM,  MACH 


Carlos  A.  Ordonez 
Assistant  Professor 
Department  of  Physics 
University  of  North  Texas 


Abstract 

A  set  of  boundary  condition  models  have  been  developed  for  the  Air  Force  computer 
simulation  program,  MACH.  This  culminates  an  important  effort  carried  out  by  the 
principle  investigator.  The  models  are  based  on  a  fully-kinetic  self-consistent  theory  de¬ 
scribing  the  plasma  sheath  under  conditions  of  space-charge  saturation.  The  phase-space 
distribution  functions  for  each  species  of  particles  are  evaluated  and  velocity  moments 
are  taken  in  order  to  obtain  particle  densities,  fluxes,  temperatures,  and  energy  fluxes. 
The  electric  potential  profile  is  determined  self-consistently  for  three  different  types  of 
profiles.  These  are  a  monotonically-decreasing  potential,  a  single-minimum  potential 
associated  with  a  negative  surface  floating  potential,  and  a  single-minimum  potential 
associated  with  a  positive  surface  floating  potential.  The  electron  emission  requirements 
for  production  of  the  third  potential  profile  are  found  to  be  severe.  Particle  and  energy 
transport  relations  are  provided  only  for  the  first  two  of  the  three  potential  profiles. 
These  relations  are  supplied  for  three  separate  species:  plasma  electrons,  plasma  ions, 
and  surface-emitted  electrons.  At  the  present  time,  these  boundary  condition  models 
are  being  extended  for  current-carrying  plasma-facing  surfaces.  It  was  felt  that  this  ex¬ 
tension  was  sufficiently  important  to  devote  the  time  and  resources  allocated  toward  the 
project  to  this  effort. 
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INCORPORATION  OF  BOUNDARY  CONDITION  MODELS  INTO  THE  AIR  FORCE 
COMPUTER  SIMULATION  PROGRAM,  MACH 

Carlos  A.  Ordonez 


I.  Introduction 

Computer  programs,  such  as  MACH,  which  simulate  plasma  processes  often  re¬ 
quire  boundary  conditions  in  the  form  of  particle  and  energy  transport  relations  at 
plasma/ surf  ace  interfaces.  In  this  report,  boundary  conditions  suitable  for  incorporation 
into  such  computer  programs  are  presented  which  include  the  effect  of  the  plasma  sheath. 
The  boundary  conditions  are  based  on  an  extension  of  a  fully-kinetic  self-consistent  theo¬ 
retical  description  of  the  plasma  sheath  found  in  Ref.  [1].  In  Section  II,  the  planar  source 
model  is  presented  and  applied  to  a  planar  source  of  half-Maxwellian  particles.  In  Section 
III,  the  plasma-electron,  plasma-ion,  and  surface-emitted-electron  phase-space  distribu¬ 
tion  functions,  and  associated  densities,  fluxes,  normalized  temperatures,  and  energy 
fluxes  are  provided.  In  Section  IV,  the  presheath  and  sheath  potentials  are  evaluated 
numerically.  In  Section  V,  the  boundary  conditions  at  the  presheath/sheath  interface  are 
given  along  with  the  simple  expressions  for  the  presheath  and  sheath  potentials.  Section 
V  is  self-contained  for  easy  referral.  A  concluding  summary  is  found  in  Section  VI. 

II.  Planar  Source  Model 

In  this  section,  the  planar  source  model  is  used  to  describe  the  transport  of  an 
individual  species  of  charged  particles  through  three  types  of  electric  potential  profiles 
under  steady-state,  collisionless  conditions.  In  order  to  avoid  having  to  consider  the  sign 
of  the  charged  particles,  the  theory  is  developed  in  terms  of  potential  energy  profiles. 
The  three  potential  energy  profiles  are  illustrated  in  Fig.  1.  In  all  three  cases,  a  planar 
particle  source  is  located  at  xa  which  injects  particles  with  a  known  velocity  distribution 
into  the  spacial  region  of  interest.  The  spacial  region  of  interest  is  located  between  xa 
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and  xn  (at  the  plasma-facing  surface)  and  particles  which  reach  x  <  x0  or  x  >  xn  are 
considered  lost  from  the  region.  The  injected-particle  velocity  distribution  function  at 
x0  is  written  as  f0(v X)vy,vz)@(vx)  where  the  Heaviside  step  function,  0,  is  included  to 
indicate  that  only  particles  traveling  in  the  positive-x  direction  are  injected  at  x0.  As  a 
result  of  the  conservation  of  energy  and  momentum,  the  initial  velocity  in  the  x  direction 
of  an  injected  particle  at  xa  is  related  to  the  particle’s  velocity  in  the  x  direction,  vx, 
while  the  particle  is  located  at  position,  x,  by 


yox  — 


2[U{x)-U0' 

m 


(1) 


where  U0  =  U(x0 )  is  the  particle’s  potential  energy  at  x0  and  m  is  the  particle  mass.  The 
phase-space  distribution  function  for  the  charged  particles  must  satisfy  the  steady-state 
Vlasov  equation, 


mt), 


g/fev)  _  dU(x)  df(x,v) 


(2) 


dx  dx  dvx 

while  taking  into  account  inaccessible  regions  in  phase-space.  In  terms  of  the  injected- 
particle  velocity  distribution,  the  phase-space  distribution  function  given  by,  f(x ,  vx,  vy,vz) 
fo(vox,vy,vz),  satisfies  the  steady-state  Vlasov  equation  where  vox  is  replaced  by  the 
right-hand-side  of  Eq.  (1). 


For  the  first  potential  energy  profile,  Fig.  la,  the  injected  particles  encounter  an 
asymmetric  potential  well.  In  the  collisionless  limit,  all  injected  particles  pass  directly 
through  the  region  between  xQ  and  xn  and  travel  only  in  the  positive-x  direction.  Since 
all  particles  must  have  velocities  in  the  x  direction  greater  than  y/2  \U~0  -  U(x)]/m,  the 
phase-space  velocity  distribution  function  is 


f  (®>  i  Vy >  l^z)  fo{^oxi  Vy >  ^z)  © 


Vr  ~ 


1 2  [U0  -  U(x)} 


m 


(3) 


For  the  potential  energy  profile  illustrated  in  Fig.  lb,  particles  can  be  reflected 
between  x0  and  xm  and  then  travel  past  x0  moving  in  the  negative-x  direction.  Under 


11-4 


conditions  of  complete  reflection,  Um  — >  oo,  the  injected-particle  and  reflected-particle 
parts  of  the  phase-space  distribution  between  xa  and  xm  are  symmetric.  The  only  dif¬ 
ference  for  a  finite  value  for  Um  is  that  particles  between  xQ  and  xm  with  velocities 
in  the  positive-x  direction  greater  than,  \J2  [f/m  —  U{x)\/m ,  are  not  reflected.  Conse¬ 
quently,  particles  traveling  in  the  negative-x  direction  are  restricted  to  velocities,  greater 
than  —  y/2  [Um  —  U{x)]/m  and  the  phase-space  distribution  function  which  describes  the 
charged  particles  between  xQ  and  xm  is 


f(x,vx,vy,vz)  =  fo(vox,vy,vz)Q  vx  + 


U(x)} 


(x0  <  X  <  xm) 


In  the  region,  xm  <  x  <  xn,  particles  are  accelerated  in  the  positive-x  direction  and  are 
restricted  to  velocities  greater  than  >/2  [£/m  —  U{x)\Jrn .  Consequently,  the  phase-space 
distribution  function  in  this  region  is 


f{x,Vx,Vy,Vz)  =  fo(Vox,Vy,Vz)Q  V. 


(xm  <  X  <  xn)  (5) 


The  third  potential  energy  profile,  Fig.  lc,  causes  the  particle  velocity  distribution 
at  any  location  between  xa  and  xr  to  occupy  two  separate  velocity  regions.  One  region 
describes  particles  moving  in  the  positive-x  direction.  These  particles  have  a  velocity  in 
the  x  direction  greater  than  yJ2  [UQ  —  U(x)]/m  and  are  associated  with  the  phase-space 
velocity  distribution  function, 


f(x,vx,vy,vz)  =  fo(vox,vy,vz)Q  vx- 


l2[U0-U(x)\ 


(x0  <  x  <  xr\  vx  >  0) 


The  second  region  describes  particles  moving  in  the  negative-x  direction.  These  particles 
have  vx  values  between  —  \J2  [Un  —  U(x)\/m  and  —\/2  [U0  —  U (x)}/rn  and  are  associated 
with  the  phase-space  distribution  function, 

. . . ,,j\.  ,  imun-u(x)]\ 


f(x,vx,vy,vz)  =  fo(vox,  Vy,VZ)G  vx  + 
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X©  —  V 


<2  Wo  -  U(x)] 


(xQ  <  x  <  xr ;  vx  <  0) 


Altogether,  the  phase-space  distribution  function  for  particles  between  xQ  and  xr  can  be 
written  as 

f(x,vx,vy,vz)  =  fo(vox,vy,vz)  1-©  (vx  +  )  0  f\/2^0  ~  Vl 


x@  [  vx  + 


l2[Un-U(x)] 


(xD  <  X  <  XT) 


Notice  that  the  velocity  distribution  at  xr  is  the  same  as  at  x0  since  U(xr)  =  U(x0). 
Consequently,  the  phase-space  distribution  function  for  particles  between  xr  and  xn  is 
similar  to  that  given  by  Eq.  (4).  It  is 


f(x,vXtvy,vz)  =  fo(vox,vy,vz)Q  vx  + 


>2[Un-U(x)] 


(Xr  <  X  <  Xn)  (9) 


For  the  three  potential  energy  profiles  shown  in  Fig.  1,  the  planar  particle  source 
located  at  xa  is  now  considered  to  inject  particles  having  a  half-Maxwellian  velocity 
distribution.  The  half-Maxwellian  velocity  distribution  is 


/o(v)0(vx)  =n0  (-)  e~pv  0(vx) 

\7T  J 


where  j3  =  m/{2  T0 ),  and  n0  and  T0  are  the  density  and  temperature  (the  latter  in  energy 
units)  associated  with  the  full-Maxwellian  velocity  distribution,  /0(v).  For  the  potential 
energy  profile  shown  in  Fig.  la,  the  phase-space  distribution  function  is  given  by  Eq. 
(3).  For  half-Maxwellian  injected  particles,  the  phase-space  distribution  is 


/a\ 3/2 

f(x,  v)  =  n0  (  —  J  e-0v  +ipox  © 
\7T  J 


where  i/jox  =  [U0  —  U (x)}/T0.  The  particle  density,  particle  flux,  normalized  temperature, 
and  energy  flux  are 

n  =  J  fd3v  =  ^n0G1(ip0X)  (12) 
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T  2 P  \j  v2  fd3v  / /  ux/d3u\2l  _  ,  2  -y/^oxA  2 

T0~~Z  [  f  fd3v  [  J  f  d3v  )\~  3Gi(^ox)  3tt  [Gj 


(V'ox)]' 


1  [  2  x  j3  (2  +  V’ox)  n0  T0 

=  -mjvvtfdv=  2^— 


"’  2  J  XJ  2V?^ 

Here,  Gi(a;)  =  e1  erfc  (\/x")>  where  erfc  is  the  complementary  error  function. 


For  the  potential  energy  profile  shown  in  Fig.  lb,  the  phase-space  distribution 
function  is  given  by  Eqs.  (4)  and  (5).  With  half-Maxwellian  particles  injected  at  x0,  the 
distribution  function,  density,  flux,  normalized  temperature,  and  energy  flux  are 


r/  \  ( ^  —0v2—i b-m  r\  (  i  / V'ma 

/(z,v)  =  n0  I  -  1  e  p  pxo@\vx  +  J  — 


P\  '  e-pv2-ipxo  q( 


(x0  <  X  <  xm ) 


(*^7Tl  ^  X  Xji j 


TL  —  ^  rio  C  ^>mo  (j2 (ipmx)  {?o  <  X  Xm ) 

=  —  n0  e  ^mo  G\{^mx)  m  <  x  xn ) 


T  _  ,  2  v'V'mx/vr  2  (  ^  ^  \ 

a^Mw)]2  (I°  *  Xm) 

=  l  +  2,^T4-0  ,  *  ,l2  (xm<I<x„)  (19) 

3Gi(?/)mx)  3  7T  [Gi^roa;)] 

and 

^ _  (2  +  'iprnx)  no  Tp  e  ^’mo  (20) 

2  a/tt/5 

where  V>xo  =  [^0*0  -  ^o]/T0,  i^mx  =  [£4n  -  ^(^)]/T0,  'ipmo  =  [b^m  -  ^o]/T0,  and  G2(a:)  = 
ex  erfc  (—,/x). 
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With  half-Maxwellian  particles  injected  at  xa  into  the  potential  energy  profile  shown 
in  Fig.  lc,  the  phase-space  distribution  function  is  given  by  Eqs.  (8)  and  (9).  Since  the 
velocity  distribution  at  any  location  between  x0  and  xr  occupies  two  separate  velocity 
regions,  a  separate  temperature  can  be  associated  with  each.  In  order  to  evaluate  the 
normalized  temperature,  the  phase-space  distribution  function  given  by  Eq.  (6),  which 
describes  particles  moving  in  the  positive-x  direction,  is  used.  The  density,  flux,  and 
energy  flux  are  found  using  the  phase-space  distribution  function  given  by  Eqs.  (8)  and 
(9).  The  relations  are 


3/2 


ft  X  (P  V  - 

/(z,v)  =  n0  -  1  e 

\7T/ 


■(3v2+'tp0 


i -©K  +  */%  )  e(J% 


P 


P 


X©  (  vx  + 


‘A 


p 


( Xa  <  X  <  xr ) 


and 


Tl  =  Tin 


GiftM  -  \  e-*"°  G,Wn.) 


(xr  <  X  <  xn) 
(xa  <  X  <  xr ) 


=  2n°e  l/,no  G2(Ax)  {xr  <  X  <  xn) 


F  = 


nQe 


2  \/tt  (5 


T_  _  J  2  y/AA^_ _ 2_ 


To  3Gi(,0OI)  3  7r  [Gi('0oi)]2 

_  j  2  \J‘ Tpnx/ 7T  2 


(xa  <  x  <  xr;  vx  >  0) 


3G2{Ax)  3  7T  [G2(4i)]‘ 


(xr  <  X  <  xn) 


Q  = 


(2  +  Ax)  no  Tq  e 

2  ]3 


(21) 


(22) 

(23) 


(24) 


(25) 


III.  Presheath  and  Sheath  Description 


11-8 


Illustrations  of  three  possible  electric  potential  profiles  within  a  presheath  and  sheath 
are  shown  in  Fig.  2  and  are  given  in  order  of  increasing  fluxes  of  emitted  electrons  from 
the  plasma-facing  surface.  The  first  profile,  Fig.  2a,  is  a  monotonically  decreasing  poten¬ 
tial  which  occurs  if,  for  example,  the  wall  surface  emits  no  electrons.  The  second  and  third 
profiles,  Figs.  2b  and  2c,  occur  in  space-charge  saturated  sheaths.  Notice  that  with  a 
sufficient  flux  of  emitted  electrons,  the  electrically-floating  wall  gains  a  positive  potential 
with  respect  to  the  plasma.  This  phenomenon,  which  is  illustrated  in  Fig.  2c,  has  been 
experimentally  observed.  [2]  For  all  three  profiles,  half-Maxwellian  plasma  electrons  and 
ions  are  injected  at  xp  while  half-Maxwellian  surface-emitted  secondary  and  thermionic 
electrons  are  injected  at  xs.  The  relations  describing  plasma  electrons,  plasma  ions,  and 
surface-emitted  electrons  within  the  presheath  and  sheath  are  derived  following  a  pro¬ 
cedure  similar  to  that  described  in  Section  II  and  given  below.  The  notation  used  is  as 
follows.  Location  subscripts:  p  -  edge-plasma/presheath  interface;  b  -  presheath/sheath 
interface;  m  -  electric  potential  minimum;  r  -  the  location  in  Fig.  2c  where  the  electric 
potential  equals  that  at  xp  (xr  is  not  shown  in  Fig.  2);  s  -  sheath/surface  interface; 
and  x  -  location  along  the  x  coordinate.  Two  adjacent  location  subscripts  means  the 
difference  in  values  at  the  two  locations.  For  example,  (f)rns  =  4>m  —  (j>s-  Particle  sub¬ 
scripts:  e  -  plasma  electrons;  i  -  plasma  ions;  and  6  -  surface-emitted  electrons.  Symbols: 
(f>  -  the  electric  potential;  e  -  the  unit  charge;  and  Z  -  the  ion  charge  state.  Defini¬ 
tions:  pe  =  me/(2TPe);  Pi  =  mi/(2Tpi);  fy  =  me/(2TsS)]  ipe  =  Ue/Tpe  =  -e0/Tpe; 
^ i  =  Ui/Tpi  =  Ze(f)/Tpi ;  and  V>«5  =  U6/Ts6  =  -ecf)/Tss. 


Plasma  electrons  within  the  potential  profile  shown  in  Fig.  2a  are  described  by: 

3/2 


fe{x,v)  =  npe  ( —  ) 

\  7T  J 


e~PeV  'Ipxpe  Q[Vx  + 


,Jips 


Pe 


ne  =  ^npee  ^spe  G2{tpSxe) 
_  npe  e~^ 

*e~  2V^PI 


(26) 

(27) 

(28) 
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(29) 


=  1  - 


2  '\/'0sa:e/7r  2 

3  G2{lpsxe)  3  7T  [G2(lpSxe)Y 


<1  —  (2  +  j^sxe)  Kpe  ^pe  6  ^3pe  .  . 

2V^T  (30) 

Plasma  electrons  within  the  potential  profiles  shown  in  Figs.  2b  and  2c  are  described 
by: 


fe{x,v)  =  nf 


,-/3ev'2— 4>, 


xpe  0  V,  + 


Tmxe 


(xp  <  X  <  xm) 


n„R  (— ^  e  0ev2  ^xpe  Qlvr  -i  — 


(xm  <  x  <  xs)  (31) 


ne  2  ^ pe  ®  1^mPe  G2{fpmxe)  {Xp  <  X  <C  ^m) 


2  ^Pe  ®  ^’mpe  Gi(lpmx e)  (^-m  ^  X  <  Xs) 

F  _  npe  e~^e 
2\/7r/3e 

2  \/V,mxe/7r  2 

TPe  “  3G2(Vw)  "  3 7T  [G2(i/»mxe)]2  ^  <  *  <  Xmj 

..  2  y/^pmxej^  2  . 


=  1  + 


3Gi(^  771  xe  )  3tt  [Gi(i/;m2:e)]: 


(xm  <  x  <  xa) 


^  _  (2  +  ^mie)  Upe  Tpe  e  ^mpe 

^e_  2V^T  (c 

Plasma  ions  within  the  potential  profiles  shown  in  Figs.  2a  and  2b  are  described  by: 


/i(*.  v)  =  npj  (£) 3/  e  (t>x  -  J 


77 j  —  2  77pi  G\{^l)pXi) 
p  _  npi 

1  2V^T 
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(39) 


T{  _  i  2  y^pxi/^  _ 2 _ 

Tm  SGifypxi)  37 r  [Gi^pxi)]2 


^-v  _  (2  j~  V^pxi)  tipi  Tpi 

Qi~  2^ 

Plasma  ions  within  the  potential  profile  shown  in  Fig.  2c  are  described  by: 


fi  (®  j  v)  =  Tipi 


e-PiV*+%l>pxi  1  _  ©  [  „  + 


X©  I  Vx  + 


(xp  <  X  <  xr ) 


e-Piv2—4>xpi  0  + 


(xr  <  X  <  xs ) 


Tli  —  Tipi  G\ (ippxi)  2  ®  ^’spt  G\{rj}Sxi)  (%p  X  <  xr) 

=  2  ®  (?2('0sxi)  (®r  <  %  (^s) 

npi£^_ 

2y/rr  fa 

T<  l  ,  2\/Vlpii  /* _ 2  (x  <  X  <  X  '  V  >  0) 

T^-'+SG.Ww)  3*  [GlMv*i)]2  ” 


2  \/ Tpsxi/'K  _ 2 _ 

3G!2('0sxi)  37T  [C?2(^sxi)]" 


(xr  <  X  <  Xs ) 


Q  _  (2  +  t/>sxz)  ^pi  Tpi  e  ^3pi 

2 

For  the  description  of  surface-emitted  electrons,  particle  and  energy  fluxes  are  defined 
as  positive  in  the  negative-x  direction.  Surface-emitted  electrons  within  the  potential 
profile  shown  in  Fig.  2a  are  described  by: 

/«(*,  v)  =  naS  (ff  e  (»,  -  )  (46) 
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(47) 

(48) 


ft  6  —  2  ftsS  Gi{ipSxs) 

p  _  ^ 

6  20F& 

Ts=1  _ 2 

TS6  3 Gi(ipsxs)  3tt  [Gi(ipsxs)]2 


(49) 


and 


_  (2  ~ |-  l/lsxs)  fts6  Tsg 

Q‘-  urn 

Surface-emitted  electrons  within  the  potential  profiles  shown  in  Figs, 
described  by: 


(50) 

2b  and  2c  are 


fs(x,  v)  =  nsS  e  p6y2  ^x‘s  ©  -  y  j  (xp  <  x  <  xm) 


and 


—  fts6 


ft 


7T 


^  '  e-^v2-^6  e(vx  +  )j 


'firnxS 


(xm  <  X  <  xs)  (51) 


ft6  =  7}  ftsS  e  'Pms6  Gli'lpmxs)  (xp  <  X  <  Xm) 

—  2  fts6  &  ^m,s  )  ( Xm  <'»  X  <  Xs ) 


FS  = 


^s6  ^ 


~'4)msS 


2\/7T  p6 


T6  2  2 

/-»-!  ^  I  4-X  /  I  \  _ 


Ts6  3Gi(lpmxs)  3-n  [Gi(lpmxs)Y 

_  1  _  2\/^  mx  s_h_ _ 2 


(Xp  X  Xm) 


3G2(lf>mx6 )  37T  [(^(V'mxtf)]' 


(xm  ^  3)  Xg) 


(2  -f~  ’^rnxs)  fts6^ 6 

2  Vines' 


(52) 

(53) 


(54) 


(55) 


Secondary  electron  emission  occurs  when  an  electron  strikes  a  material  surface  and 
causes  an  ejection  of  one  or  more  electrons  from  the  surface.  This  emission  process 
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is  characterized  by  the  electron  emission  coefficient,  <5,  which  equals  the  average  ratio 
of  electrons  emitted  from  the  surface  to  electrons  incident  on  the  surface.  A  relation 
for  secondary  electron  emission  suitable  for  a  plasma-facing  surface  when  space-charge 
saturation  is  not  present  within  the  sheath  is  [3] 


S(Tpe)  = 


roo  e7/6  (2  +  3e)  e_e/,r 
3  (1  +  e)2 


where  r  =  0.72Tpe/Emax,  and  8max  and  Emax  are  material-dependent  constants.  (Values 
for  Smax  and  Emax  are  found,  for  example,  in  Ref.  [4].)  When  space-charge  saturation 


is  present  within  the  sheath, 


2-6  6rnax  f°°  1/6  e/r  f  3  +  4  y/r  Vw  6  _  3  +  4el  ^ 


'nr3  Giilpmse)  Jrrl>„ 


(1  +  y/rTp 

mse  €  (1 + ^ 


provides  a  suitable  relation  for  secondary  electron  emission.  [3] 


The  gross  flux  of  emitted  electrons  at  the  material  surface,  Fgs,  is  given  by  Eq.  (48). 
Consequently,  the  source  density  of  surface-emitted  electrons  is 

nss  =  2  y/irPs  Fgs  (58) 

For  secondary  electron  emission, 

FgS  =  6Fe  (59) 

where  Fe  is  the  net  flux  of  plasma  electrons  (which  is  the  same  at  any  location  with  the 
presheath  and  sheath)  striking  the  surface.  Note  that  the  return  of  secondary  electrons 
to  the  surface  within  a  space-charge  saturated  sheath  is  not  considered  to  result  in 
additional  secondary  electron  emission.  Only  plasma  electrons  (which  normally  will 
have  much  larger  energies)  which  strike  the  surface  are  considered  to  produce  secondary 
electrons.  For  thermionic  electron  emission,  the  gross  electron  flux  leaving  the  surface  is 
given  by  the  Richardson-Dushman  equation, 

F96=AT52{e-^  (60) 
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where  $  is  the  work  function  of  the  plasma- facing  surface  at  temperature,  Tss,  and  A  is 
a  material-dependent  constant.  (Values  for  A  are  found,  for  example,  in  Ref.  [5].)  In 
order  to  use  the  same  description  for  both  secondary  electron  emission  and  thermionic 
electron  emission,  Eqs.  (58)  and  (59)  can  be  combined  to  give, 

ns6  =  2  \/ti  P6  <5  Fe 

where,  for  thermionic  electrons, 

AT?se~*/T- 

F 5 

should  be  used  while  for  secondary  electrons,  Eqs.  (56)  and  (57)  are  suitable. 

IV.  Evaluation  of  the  Sheath  and  Presheath  Potentials 

A  number  of  conditions  are  implemented  in  order  to  evaluate  the  sheath  and  presheath 
potentials.  The  first  two  conditions  are  that  the  total  current  density  to  the  electrically- 
floating  surface  is  zero  (for  a  steady-state  solution)  and  that  the  charge  density  at  the 
presheath/sheath  interface  is  zero.  The  third  condition  is  that  the  spacial  integral  of 
the  charge  density  in  the  presheath  is  zero.  This  requires  the  presheath  to  be  globally 
quasineutral.  It  should  be  noted,  however,  that  a  manifestation  of  the  presheath  model 
used  here  is  that  the  presheath  is  not  locally  quasineutral  except  at  the  presheath/sheath 
interface.  The  fourth  condition,  which  applies  only  when  space-charge  saturation  takes 
place  within  the  sheath,  is  that  the  electric  field  at  the  electric  potential  minimum  is 
zero. 


(62) 


(61) 


The  first  condition,  zero  current  density,  requires  Z  Fi  —  Fe  —  F^.  With  Eqs.  (28), 
(33),  (38),  (43),  (48),  (53),  and  (61),  this  condition  provides  the  relations, 
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Of  =  e  ^mpe  (l  -6e  ^m35)  J— 


a  =  gV'spt— V'mpe  (;[  _  <5e  (65) 

which  correspond  to  the  potential  profiles  shown  in  Figs.  2a,  2b,  and  2c,  respectively. 
Here,  a  =  Znpi/npe  is  a  parameter  called  the  neutralization  factor,  T]  =  rrii/me,  and 
Ti  =  Tpi/Tpe. 

In  order  to  implement  the  second  and  third  conditions,  the  charge  density,  p  — 
e(Zrii-ne-  ns),  between  xp  and  xb  is  needed.  The  charge  density  can  be  written  as  a 
set  of  three  relations  in  terms  of  a  corresponding  to  the  three  potential  profiles  using  Eqs. 
(27),  (28),  (32),  (33),  (37),  (42),  (47),  (52),  and  (61).  Substituting  for  a  the  right-hand 
sides  from  Eqs.  (63-65)  provides, 

=  (!-{)  ,/T G i(Wi)  -  G2«w)  -  ~7=  GiWws)  (66) 


2pe*"-  =  (l-6e-*™‘)  G,M>p.i)  -  G2«w)  -  Gi^mxe)  (67) 

enpe  v  V  Ti  y/T6 


(1-6  e ~'pmsS )  4  —  [2  e^spi  Gx  (t l)pxi)  -  Gx  (i>Sxi)] 


- G2(i>mxe )  -  —  e~^°*  Gi(tpmxs)  (68) 

Vr<5 

where  rs  =  Tss/Tpe.  The  second  condition,  zero  charge  density  at  the  presheath/sheath 
interface,  requires  the  right-hand-sides  of  the  above  relations  to  equal  zero  at  x b .  This 


written  as 


(1  —  6)  J G\(lppbi)  =  G^i'tpsbe)  +  G\(lpsb6) 

(1-6  e_V,ms5)  ^GMpbi)  =  G2(lpmbe)  +  “7=  e~^msS  G^lpmbs) 

V  V  Ti  V  T& 
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and 


(l  -  be  ^ms6) 


[2e*'-«  GxirM  -  Gi(fe)]  =  G2(Vw)  +  -7=  e 
T*  VT<5 


ms6  G^lpmbs) 

(71) 


The  third  condition,  that  the  spacial  integral  of  the  charge  density  in  the  presheath 
is  zero,  requires  zero  electric  fields  at  xp  and  xb.  With  no  electric  fields  at  the  edge- 
plasma/presheath  interface  and  at  the  presheath/sheath  interface,  it  can  be  shown  (see, 
for  example,  Eq.  (36)  of  Ref.  [1])  that  pd<p  =  0  or,  equivalently,  pdipxe  =  0. 
Consequently,  with  Eqs.  (66-68),  the  third  condition  requires 


(1-6Ulr-  M 
T%  JTPP 


rlpbe 


ri’be  Plpbe 

G\  {^pxi)  xe  —  /  G2(tP  sxe  )#  xe  “1“  / —  /  ^1  {^Psxs)  d'tp  xe  (72) 

dlppe  V  ^  d  Tppe 


(l  —  be  ^ms6) 


V'fc. 


r*  Ji>P 


y/n  j^pe 

G\  {ippxi )  dip  xe 

r'tpbe 
f  V'pe 


rwbe  ft  rWbe 

I  G2('4)mxe)  d'lfcxe  H  — :  6  ^ 7X138  I  G\  {^mxs)  d'lpxe 

Jlppe  yT6  JiPpe 

r^bc  r'ipbe 

2 e*”*  Gi(ippxi)dipxe  -  /  Gi(ipaxi) dipa 

dlPnp  d  ibnp 


(1  -  be  ’/'m,s)  A  — 


r^be  Cj  rrl)be 

=  /  G2(lpmxe)  dlpxe  +  /  G 1  [ipmxS )  dip 

dippe  VT6  d  1 Ppe 


(73) 


(74) 


Since  the  electric  field  is  zero  at  the  presheath/sheath  interface,  the  fourth  condition, 
which  requires  the  electric  field  at  the  electric  potential  minimum  to  be  zero,  is  equivalent 
to  requiring  the  spacial  integral  of  the  charge  density  between  the  presheath/sheath 
interface  and  the  potential  minimum  to  be  zero.  Hence,  the  fourth  condition  is  similar 
to  the  third  except  that  it  applies  only  to  the  profiles  shown  in  Figs.  2b  and  2c  and  that 
it  applies  between  xb  and  xrn.  Using  Eqs.  (67)  and  (68)  for  the  charge  density  between 
xb  and  xm,  the  fourth  condition  requires, 

I  rp  P'tpme 

(l  —  6e~^m,s)  A—  /  Gi(ippXi)  dipxe 

V  T»  Jlpbe 
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Plpme  P^Pme 

=  /  G2{'4)mxe)  d'lfcxe -\  -=.  e  ^ 1719 6  I  G\  {^rnxS)  d/^xe 

J^be  VT<5  Jlpbe 

npr  ptpme  P'tpme 

(l  -  6 e~^maS)  J —  2 e^spi  /  Gi('ippxi)d'ipxe-  Gi(V’sii)#xe 

V  Ti  L  Jlpbe  J-tpbe  j 

P'ipme  P^me 

—  I  G2  i^mxe)  d'lftxe  H  -=.£  ^ms<5  I  Gl  {'ifcmxs)  ^0xe 

•Ah.  \/^  A'Oe 

The  integrals  in  Eqs.  (72-76)  are  given  by  the  two  indefinite  integrals, 

J"  G\ {x)  dx  —  G\  ( x )  -f-  2  ^ — 


J  G2(x)  dx  =  G2(x)  -  2  ^ j 


along  with  ipi  =  —Zipe/ri  and  ^<5  =  V’e /'Ts  -  The  electric  potential  at  one  location  must 
also  be  defined.  Hereafter,  the  electric  potential  at  the  presheath/sheath  interface  shall 
be  defined  as  zero,  ipbe  =  0.  With  i/ibe  =  0,  ipse  gives  the  normalized  sheath  potential  drop 
(as  a  positive  value  for  a  monotonically  decreasing  potential)  and  ippe  gives  the  normalized 
presheath  potential  drop  (as  a  negative  value).  For  a  monotonically  decreasing  potential 
(Fig.  2a),  Eqs.  (69)  and  (72)  are  solved  simultaneously  for  the  two  unknown  parameters, 
ippe  and  ipse-  For  a  space-charge  saturated  sheath,  the  three  unknown  parameters,  i ppe, 
ip  me  and  ipse,  are  solved  simultaneously  using  Eqs.  (70),  (73),  and  (75)  if  the  surface  has 
a  negative  floating  potential  with  respect  to  the  edge  plasma  and  Eqs.  (71),  (74),  and 
(76)  if  the  surface  has  a  positive  floating  potential.  In  order  to  determine  the  transition 
from  the  potential  profile  in  Fig.  2a  to  that  in  Fig.  2b  (this  transition  occurs  at  the 
onset  of  space-charge  saturation),  Eqs.  (70),  (73),  and  (75)  are  solved  simultaneously 
for  6,  'ippe  and  'ipse  under  the  condition,  ipme  =  ipse-  In  order  to  determine  the  transition 
from  the  potential  profile  in  Fig.  2b  to  that  in  Fig.  2c,  Eqs.  (70),  (73),  and  (75)  are 
solved  simultaneously  for  6,  ippe  and  'ipme  under  the  condition,  ipse  =  ippe- 
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Once  the  values  for  the  normalized  potentials  are  known,  the  profile  of  the  electric 
potential  within  the  presheath  and  sheath  is  evaluated  using  Poisson’s  equation.  The 
procedure  is  outlined  in  Ref.  [1].  Eq.  (39)  of  Ref.  [1]  is  written  here  as 


x 

XD(p ) 


2  PMA 

6  Tlpe 


1  -1/2 


dip 


(Xp  X  Xjji) 


(79) 


where  xv  =  0,  \D{P)  is  the  Debye  length  in  the  edge  plasma,  and  2  p(ip'J.e)  /  (e  npe)  is  given 
by  Eqs.  (66-68).  The  inner  integral  of  this  relation  is  evaluated  in  closed  form  while  the 
outer  is  evaluated  numerically.  The  results  are  then  inverted  to  obtain  t/;xe  as  a  function 
of  x.  Fig.  2  shows  actual  self-consistent  calculations  of  different  electric  potential  profiles. 
(The  vertical  dimensions  are  different  for  each  of  the  three  profiles  shown  in  Fig.  2.  The 
horizontal  dimensions  are  the  same,  however,  and  the  width  of  the  curve  in  Fig.  2a  is 
27  A£)(p).)  The  profiles  are  calculated  for  a  thermal  (t*  =  1)  hydrogen  (Z  =  1,7]  =  1836) 
plasma  which  is  bounded  by  an  electron-emitting  surface  (with  r6  =  0.2).  The  profiles 
are  shown  in  order  of  increasing  values  for  the  electron  emission  coefficient:  6  =  0  for 
Fig.  2a,  6  =  15  for  Fig.  2b,  and  6  =  1000  for  Fig.  2c.  The  values  for  the  normalized 
potentials  calculated  for  each  of  the  profiles  are:  ippe  =  —0.34  and  ipse  =  2.5  for  Fig.  2a; 
4>Pe  =  -0.43,  ipme  =  0.29  and  ipse  =  -0.28  for  Fig.  2b;  and,  ippe  —  -0.53,  ipme  =  0.31 
and  ipse  —  —1.1  for  Fig.  2c.  The  onset  of  space-charge  saturation  occurs  at  6  =  0.88, 
V’pe  =  —0.43,  and  xpme  =  ipse  —  0.29  while  the  transition  from  a  negative  to  a  positive 
surface  floating  potential  occurs  at  6  -  31,  ippe  =  ipse  -  -0.43,  and  xpme  =  0.29. 


V.  Boundary  Conditions 


The  boundary  conditions  provided  in  this  section  are  those  which  occur  at  the 
presheath/sheath  interface.  This  location  is  both  quasineutral  and  electric-field  free. 
For  brevity,  location  subscripts  are  not  used  in  this  section  since  quantities  which  were 
previously  location  dependent  are  now  only  considered  at  the  presheath/sheath  interface. 
The  boundary  conditions  at  the  presheath/sheath  interface  are  the  pressure  terms,  (vp), 
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( v\ )  and  (vl)  —  (vx)2,  the  plasma  flow  velocity,  (vx),  and  the  energy  flow  term,  (v2  vx). 
Using  relations  from  Section  III,  the  boundary  conditions  are  written  in  terms  of  the 
electron  and  ion  plasma  temperatures  (Te  and  Ti)  at  the  presheath/sheath  interface,  the 
temperature  (Tg)  associated  with  electrons  emitted  at  the  plasma-facing  surface,  the  ratio 
(xj)  of  ion  to  electron  plasma  temperatures  at  the  edge  plasma,  the  normalized  presheath 
(ippe),  sheath  {'tpae),  and  minimum  (ipme)  potentials,  the  electron  and  ion  masses  (rae 
and  rrii)  and  the  ion  charge  state  (Z).  Using  relations  in  Section  IV,  the  normalized 
potentials  are  evaluated  numerically  and  the  values  obtained  are  fit  in  terms  of  Z,  Ti, 
the  electron  emission  coefficient  (6),  which  equals  the  ratio  of  surface-emitted  electron 
flux  to  plasma  electron  flux  incident  on  the  surface,  the  ratio  (77)  of  ion  to  electron  mass, 
and  the  ratio  (Tg)  of  the  temperature  associated  with  electrons  emitted  at  the  surface 
to  the  electron  plasma  temperature  at  the  edge  plasma.  Of  the  three  types  of  electric 
potential  profiles  shown  in  Fig.  2,  the  third  is  expected  to  be  rare  since,  for  Tg  <  0.1,  the 
electron  emission  coefficient  must  be  very  large,  6  >  103,  to  produce  the  type  of  potential 
profile  shown  in  Fig.  2c  (with  Tj  ~  1).  For  this  reason,  only  boundary  conditions  for  the 
electric  potential  profiles  shown  in  Figs.  2a  and  2b  are  presented.  Also,  all  calculations 
of  normalized  potentials  are  carried  out  using  the  same  Tg  value,  Tg  =  0.01.  This  is 
appropriate  since,  for  Tg  <  0.1,  the  normalized  potentials  are  essentially  independent  of 
Tg. 


For  use  of  the  boundary  conditions,  Ti  is  the  only  non-local  parameter  needed.  The 
value  for  r,  should  be  evaluated  at  one  of  two  possible  locations  depending  on  which  of 
the  two  has  a  shorter  path  length  to  the  wall  surface  (the  path  length  should  be  along  a 
magnetic  field  line  if  a  magnetic  field  is  present).  One  location  is  an  ion  mean-free-path 
away  from  the  wall  surface  while  the  other  location  is  where  the  maximum  temperature 
occurs  along  the  magnetic  field  line.  The  first  of  these  two  locations  applies  when  no 
magnetic  field  is  present  or  when  the  source  of  presheath  ions  is  predominantly  either 
ionization  or  diffusion  parallel  to  a  magnetic  field;  the  second  method  applies  when 
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diffusion  perpendicular  to  a  magnetic  field  provides  most  of  the  presheath  ions. 


Two  of  the  boundary  conditions,  the  second  and  third  pressure  terms,  can  be  written 
in  terms  of  another  boundary  condition,  the  first  pressure  term.  They  are  ( v\ )  =  (y2) 
and 

(vl)~(vx)2  =  2(vl)  (80) 

The  other  boundary  conditions  are  determined  separately  for  plasma  electrons,  plasma 
ions,  and  surface-emitted  electrons.  (For  surface-emitted  electrons,  ( vx)6  and  (y2  vx)6 
are  defined  as  positive  in  the  negative-x  direction.)  For  the  electric  potential  profile 
shown  in  Fig.  2a,  the  boundary  conditions  are 


/„.2\  _ 

^  6  meXse 

(81) 

\/2 

x  e  erfc(—\/ ipae  ) 

(82) 

(v2  vx)e  =  2  (2  +  1pse)  (v2y)e  (vx)e 

(83) 

/  nm2\  _ 

^Vy  *  ~  rriiXpi 

(84) 

'  x"  erfc(v/'0p,  ) 

(85) 

(v2  vx)i  =2(2  +  ipPi)  (' v2y ).  {vx)i 

(86) 

«>*  “  £ 

(87) 

_ 

x  6  erfc(v%6  ) 

(88) 

(u2  ux)5  =  2  (2  +  V>»fi)  (vj)*  (vx)s 

(89) 
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where  the  following  definitions  are  used: 


Xse  —  1 


2\M,e/7r _ 2 _ 

3e^*e  erfc(— y/ipse  )  37r  [e^se  eric(—y/'ipse  )]2 


In  (2.51 


Xpi~1+  3e*^U/^~)  3jr[eV’p.erfc(V^')]2C3ln(2'05,i’'’iOO13)  (91) 

ippi  =  -Zippe/ri,  and  ipss  =  Teipae/(T6Xse)-  The  approximations  for  xse  and  xPi 
are  accurate  to  within  2%  for  0.1  <  ipse  <  10  and  0.01  <  tppi  <  100,  respectively. 
For  the  normalized  sheath  and  presheath  potentials,  the  following  two  expressions  are 


recommended: 


ipse  =  In  [0.2725  (1  —  6)  y/rf  r, 


-0.092  *7-0.441 


-In  1.43  ( — 


These  expressions  are  fits  to  numerically-determined  values  using  the  values:  1,10,  and 
100  times  the  proton-to-electron  mass  ratio  (1836)  for  77;  1/3,  1,  and  3  for  1,  2,  and 
3  for  Z;  and,  0  and  1/2  for  8.  The  numerically-determined  values  were  calculated  using 
all  possible  combinations  of  77,  Tj,  Z,  and  8  except  combinations  which  simultaneously 
involved  Z  =  1  and  77  =  1836.  The  expressions  for  ipse  and  V’pe  agree  to  within  4%  and 
7%,  respectively,  with  the  numerically-determined  values.  Notice  that  the  relation  for 
the  sheath  potential  has  the  usual,  ln[(l  -  8)  rf  ],  dependence  given  elsewhere.  [6,7] 


The  onset  of  space-charge  saturation  and  the  corresponding  transition  from  a  mono- 
tonically  decreasing  potential  (e.g.  Fig.  2a)  to  a  potential  profile  with  a  potential  mini¬ 
mum  (e.g.  Fig.  2b)  takes  place  if  the  electron  emission  coefficient  reaches  a  sufficiently 
large  value.  A  suitable  expression  which  provides  the  value  of  the  electron  emission 
coefficient  at  the  transition  between  profiles  is 


11-21 


This  expression  is  a  fit  to  values  determined  numerically  for  the  same  combinations  of 
7],  Ti ,  and  Z  which  were  used  for  the  ipse  and  ippe  fits.  The  Sc  fit  is  remarkably  accurate 
being  within  0.1%  of  the  numerically- determined  values. 


If  the  electron  emission  coefficient,  as  determined  using  Eq.  (56)  for  example,  is 
larger  than  6C,  boundary  conditions  associated  with  the  potential  profile  shown  in  Fig. 
2b  should  be  used.  For  the  potential  profile  shown  in  Fig.  2b,  the  boundary  conditions 
are  the  same  as  those  for  Fig.  2a  except  with  subscript,  s,  replaced  by  subscript,  m.  For 
the  normalized  sheath  minimum  and  presheath  potentials  under  space-charge  saturation 
conditions  (Fig.  2b),  the  following  two  expressions  are  recommended: 


V’me  =  In  (1.63  770  01) 


(95) 


and 


■0pe  —  In 


1.51 


(96) 


These  two  expressions  agree  to  within  5%  and  6%,  respectively,  with  numerically-determined 
values  using  the  combinations  of  7),  Tiy  and  Z  used  for  the  6C  fit  along  with  6  =  Sc  and 
6  =  10.  Note  that  these  expressions  are  independent  of  6  for  8  >  6C.  Thus,  a  specific 
relation  for  the  electron  emission  coefficient  under  conditions  of  space-charge  satura¬ 
tion  [e.g.  Eq.  (57)]  is  not  needed  for  calculating  the  associated  boundary  conditions.  It 
should  also  be  mentioned  that  trapping  of  slow  charge-exchange  ions  within  the  potential 
well  surrounding  the  potential  minimum  has  been  found  to  be  responsible  for  reducing 
space-charge  saturation.  [2]  When  that  is  the  case,  it  is  probable  that  the  sheath  potential 
profile  is  adequately  described  by  using  an  effective  electron  emission  coefficient  which 
is  smaller  than  the  actual  one.  If  so,  then  for  situations  in  which  6  >  6C,  the  boundary 
conditions  presented  here  are  unaffected  by  ion  trapping  since  the  sheath  minimum  and 
presheath  potentials  are  not  dependent  on  6.  Nevertheless,  a  more  detailed  study  of  the 
effects  of  ion  trapping  on  space-charge  saturation  appears  warranted. 


VI.  Concluding  Summary 


11-22 


A  fully-kinetic  self-consistent  theory  describing  the  plasma  sheath  under  conditions 
of  space-charge  saturation  has  been  presented.  The  phase-space  distribution  functions 
for  each  species  of  particles  have  been  evaluated  and  velocity  moments  have  been  taken 
in  order  to  obtain  particle  densities,  fluxes,  temperatures,  and  energy  fluxes.  The  elec¬ 
tric  potential  profile  has  been  determined  self-consistently  for  three  different  types  of 
profiles.  These  are  a  monotonically-decreasing  potential,  a  single-minimum  potential 
associated  with  a  negative  surface  floating  potential,  and  a  single-minimum  potential 
associated  with  a  positive  surface  floating  potential.  The  electron  emission  requirements 
for  production  of  the  third  potential  profile  were  found  to  be  severe.  For  this  reason, 
boundary  conditions  have  been  provided  only  for  the  first  two  of  the  three  potential  pro¬ 
files.  Boundary  conditions  on  velocity  moments  have  been  supplied  for  three  separate 
species:  plasma  electrons,  plasma  ions,  and  surface-emitted  electrons.  These  bound¬ 
ary  conditions  can  be  combined  as  needed  for  use  in  multi-fluid  and  single-fluid  (MHD) 
applications.  The  boundary  conditions  are  in  terms  of  sheath  and  presheath  potential 
drops  and  simple  expressions  have  been  provided  for  these  potential  drops.  Once  the 
potential  drop  expressions  are  inserted  into  the  boundary  conditions,  the  boundary  con¬ 
ditions  are  in  terms  of  the  three  species’  temperatures,  r*,  the  ion-to-electron  plasma 
temperature  ratio,  Z,  the  ion  charge  state,  rj,  the  ion-to-electron  mass  ratio,  and  6,  the 
electron  emission  coefficient. 

In  order  to  determine  the  value  of  S  at  which  the  transition  between  the  first  two 
potential  profiles  occurs,  the  following  expression  was  developed:  6C  =  1  —  7.7 x9,1  \fzjr} . 
This  expression  gives  the  value  of  the  electron  emission  coefficient  at  the  onset  of  space- 
charge  saturation.  For  6  <  6C,  the  following  expressions  are  recommended  for  the  sheath 
and  presheath  potentials  normalized  to  the  electron-plasma  temperature:  ln[0.2725  (1  — 
6)  VrfTf0-092  Z~0,44]  and  ln[1.43  (Z/r*)0'195].  For  6  >  <5C,  recommended  expressions  for 
the  normalized  sheath  minimum  and  presheath  potentials  are,  respectively,  ln(1.63^°'01) 
and  ln[1.51  (Z/Ti)0  209].  The  conditions  these  expressions  for  6C  and  the  normalized 
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potential  are  expected  to  be  suitable  for  are:  a  temperature  associated  with  the  emitted 
electrons  which  is  less  than  one  tenth  that  associated  with  the  plasma  electrons;  1/3  < 
n  <  3;  1  <  Z  <  3;  and  1836  <  7?  <  100  x  1836. 
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Figure  Captions 


Fig.  1.  The  particle  potential  energy  profiles  considered  in  Section  II.  A  planar 
source  of  particles  is  located  at  x0  which  emits  particles  in  the  positive-x  direction.  For 
the  potential  energy  curve  shown  in  (a),  U(xm )  <  U(xn)  <  U(xa)  and  all  particles 
originating  at  xa  travel  past  xn.  For  the  potential  energy  curve  shown  in  (b),  U(x0)  < 
U(xm),  U(xn)  <  U(xm),  and  a  fraction  of  the  particles  originating  at  x0  reach  a  turning 
point  between  xD  and  xm  and  pass  back  through  x0.  For  this  potential  energy  profile, 
U(xn)  <  U(x0)  is  also  possible.  For  the  potential  energy  curve  shown  in  (c),  U(xm)  < 
U(xQ )  <  U(xn),  U(x0)  =  U(xr),  and  a  fraction  of  the  particles  originating  at  xa  reach  a 
turning  point  between  xr  and  xn  and  pass  back  through  xD. 


Fig.  2.  Illustrations  of  possible  electric  potential  profiles  (solid  curves)  in  the 
presheath  and  sheath  for  (a)  no  space-charge  saturation,  (b)  space-charge  saturation 
and  a  surface  floating  potential  which  is  negative  with  respect  to  the  edge  plasma  (at 
x  <  xp),  and  (c)  space-charge  saturation  and  a  positive  surface  floating  potential.  Al¬ 
though  not  apparent  as  a  result  of  the  limited  resolution  of  the  plots,  the  electric  field  is 
zero  at  xp  and  Xb .  The  dotted  lines  are  provided  to  guide  the  eye. 


11-26 


Figure  2 


11-28 


Analysis  of  the  Structure  and  Motion  of  Equatorial  Depletion  Bands 
Using  Optical  All-sky  Images 


R.  Pickett 
Professor 

Department  of  Psychology 
and 

G.  S.  Sales  and  A.  Gee 


University  of  Massachusetts  Lowell 
Lowell,  MA 


Final  Report  for: 

Summer  Research  Extension  Program 
Phillips  Laboratory 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  Air  Force  Base,  DC 

and 

Phillips  Laboratory 


June  1997 


12-1 


Analysis  of  the  Structure  and  Motion  of  Equatorial  Depletion  Bands 
Using  Optical  All-sky  Images 


R.  Pickett 
Professor 

Department  of  Psychology 
and 

G.  S.  Sales  and  A.  Gee 


Abstract 

This  project  has  been  concerned  with  finding  an  improved  approach  to  analyzing  the  volumetric 
structure  of  equatorial  depletion  bands.  These  north-south  oriented  bands  of  depleted  optical 
emission  (630.0  nm)  appear  in  the  F-region  of  the  ionosphere  in  the  night  sky  in  the  vicinity  of  the 
magnetic  equator.  They  arise  behind  the  sunset  terminator  and  slowly  travel  eastward.  At  any 
specific  site,  two  or  three  of  these  depletions  can  be  observed  at  intervals  during  the  evening.  The 
critical  interest  motivating  the  present  work  is  that  these  regions  of  depleted  emission  in  the 
ionosphere  can  severely  disrupt  line-of-sight  radio  signals  that  pass  through  them  from  navigation 
and  communications  satellites. 
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1.  Introduction 


This  project  has  been  concerned  with  finding  an  improved  approach  to  analyzing 
the  volumetric  structure  of  equatorial  depletion  bands.  These  north-south  oriented  bands 
of  depleted  optical  emission  (630.0  nm)  appear  in  the  F-region  of  the  ionosphere  in  the 
night  sky  in  the  vicinity  of  the  magnetic  equator.  They  arise  behind  the  sunset  terminator 
and  slowly  travel  eastward.  At  any  specific  site,  two  or  three  of  these  depletions  can  be 
observed  at  intervals  during  the  evening.  The  critical  interest  motivating  the  present 
work  is  that  these  regions  of  depleted  emission  in  the  ionosphere  can  severely  disrupt 
line-of-sight  radio  signals  that  pass  through  them  from  navigation  and  communications 
satellites.  To  better  understand,  predict  and  contend  with  these  disruptions,  it  is 
important  to  learn  as  much  as  possible  about  the  volumetric  structure  of  these  regions. 
Information  on  the  volumetric  structure  has  been  obtained  to  date,  primarily  from 
analyses  of  ground-based  radar  soundings.  Volumetric  information  has  also  been  derived 
from  analyses  of  differences  in  the  character  of  line-of-sight  radio  signals  between  those 
that  “look”  through  the  depletion  and  those  that  do  not.  The  use  of  multiple  satellites  in 
different  positions,  potentially  reveals  different  aspects  of  the  depletions  volumetric 
structure.  However,  the  problem  with  both  of  these  current  approaches  is  that  they 
provide  only  crude  and  sparse  data. 

The  present  project  explores  the  possibility  of  deriving  more  fine-grained 
information  from  analyses  of  ground-based  optical  images.  Considerable  amounts  of 
optical  data  are  available,  much  of  it  simultaneous  with  the  radar  sounding  and  signal 
disturbance  observations.  To  date,  however,  analyses  of  the  optical  data  have  been 
aimed  at  mining  only  two  dimensional  information  on  the  depletions,  i.e.,  with  respect  to 
aspects  such  as  depletion  width,  lateral  position  and  motion.  But,  the  light  intensity 
recorded  at  each  pixel  in  these  images  can  be  considered  the  integral  of  the  emission 
along  the  line  of  sight  from  the  pixel  position  in  the  image  plane  through  the  imager 
optics  and  out  through  the  F-layer.  Using  an  appropriate  model  of  the  electron  density  of 
the  F-region  and  computing  the  implied  line-of-sight  emission  intensity,  one  should  be 
able  to  infer  aspects  of  the  volumetric  shape  of  these  depletions  that  account  for  the 
distribution  of  intensity  at  the  image  plane.  Required  is  a  model  to  generate  the  idealized 
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intensity  distribution  that  would  occur  at  the  image  plane  for  particular  settings  of 
altitudinal  and  volumetric  parameters  of  idealized  depletion  regions.  Smoothed  versions 
of  the  empirical  image  data  can  then  be  compared  to  the  idealized  images,  and 
parameters  of  the  model  can  then  be  adjusted  to  maximize  the  fit  of  the  ideal  image  to 
the  empirical  image.  The  required  parameter  settings  of  the  model  can  then  serve  as 
measures  of  the  volumetric  structure  of  the  real  depletion.  Such  was  the  logic  of  the 
present  project,  and  we  pursued  it  in  three  main  steps. 

1.  First,  develop  a  model  for  generating  line-of-sight  emission  and  attendant 
intensities  in  the  image  plane  as  a  function  altitude  and  volumetric  parameters. 

2.  Second,  obtain  smoothed  cross-sections  of  the  sequence  of  images  of  three 
actual  depletion  bands  and  conduct  various  analyses  of  the  position,  shape  and  motion  of 
these  cross-sections  as  a  function  of  time. 

3.  Third,  use  the  model  to  generate  synthetic  images,  taking  cross-sections  of 
model  generated  images  corresponding  to  those  we  obtained  on  actual  images  and 
determined  what  settings  of  the  model  parameters  was  required  to  maximize  goodness- 
of-fit  between  the  actual  and  the  synthetic  profiles. 

In  Section  2  below  we  present  the  relevant  technical  background: 

a.  On  the  general  characteristics  of  equatorial  depletion  bands 

b.  On  the  physics  and  chemistry  of  the  F-region  and  on  the  mechanisms  that 
form  the  depletion  bands; 

c.  On  the  imaging  system  -  its  optical  and  electronic  features  that  must  be 
considered  when  translating  pixel  intensities  at  each  point  in  the  image  to 
actual  emissions. 

In  Section  3,  we  report  on  development  of  the  model.  In  Section  4,  we  report  on  the 
empirical  and  model-based  analyses  that  we  conducted  on  the  three  selected  depletions. 
In  Section  5  we  present  a  summary  of  the  results  and  our  overall  conclusions  and 
recommendations. 
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2.  Technical  Background 


2.1.  General  Characteristics  of  Equatorial  Depletion  Bands 

At  the  magnetic  equator  and  beginning  after  sunset,  a  series  of  relatively  narrow, 

north-south  oriented  bands  of  electron  density  depletion  in  the  F-region  of  the  ionosphere 
appear  at  the  western  horizon  and  drift  slowly  eastward.  At  a  particular  site  two  or  three 
are  bom  at  intervals  during  the  evening  hours.  These  depletions,  typically  100  to  200  km 
wide  are  directly  associated  with  bubbles  of  low  density,  low  altitude  plasma  that  form 
and  rise  at  the  magnetic  equator.  The  origin  of  the  bubbles  in  the  vicinity  of  the  magnetic 
equator  is  described  in  detail  by  Kelly  (The  Earth’s  Ionosphere)  in  terms  of  the  growth  of 
Rayleigh-Taylor  instabilities  in  the  lower  F-region,  i.e.  around  250  km.  The  more  dense 
ionization  is  forced  out  of  the  bubble  volume  as  it  rises  into  the  plasma  region  above  250 
km.  The  bubbles  rise  to  about  750  km  at  the  equator  and  spread  north  and  south  by  more 
that  1000  km,  aligned  with  and  following  the  earth’s  down-heading  magnetic  field  lines. 
Agua  Verde,  Chile,  the  observing  site  for  the  depletions  used  in  this  project,  is  some  1200 
km  south  of  the  magnetic  equator,  and  the  bubbles  have  curved  down  along  the  field 
lines  at  this  point  to  about  300  km  altitude.  The  resulting  elongated  regions  of  depleted 
emission  are  seen  as  dark  bands  when  viewed  from  the  ground  against  the  background  of 
normal  630.0  nm  airglow  of  about  40  to  50  Rayleighs  at  these  latitudes. 

2.2.  Physics  and  Chemistry  of  the  F-Region 

Optical  emissions  are  generated  in  the  high  atmosphere  by  a  variety  of  chemical 
processes.  One  of  the  most  useful  emission  lines  arises  from  excited  atomic  oxygen  at  a 
wavelength  of  630.0  nm  (red  line).  This  emission  tracks  the  density  of  electrons  in  the 
interval  from  250  to  300  km  in  the  F-region  of  the  ionosphere.  During  the  daytime  when 
solar  UV  is  available  to  ionize  atomic  species  at  altitudes  in  the  F-region,  equal  numbers 
of  electrons  and  positively  charged  atomic  oxygen  ions  are  produced.  After  sunset,  with 
no  significant  ionization  sources  present,  two  chemical  processes  act  sequentially  to 
remove  the  electrons  and  the  0+  ions,  resulting  in  a  decay  of  the  lower  F-region  and  the 
emission  of  photons,  specifically  at  630.0  nm. 

The  first  of  these  processes  involves  charge  exchange  between  the  0+  ions  and 
the  neutral  molecular  oxygen  to  form  neutral  atomic  oxygen  and  an  C>2+  ion.  The 
chemical  expression  for  this  reaction  is: 
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0+  +  02  — >  O  +  02  • 

It  proceeds  at  a  laboratory  measured  rate  ki  =  2  x  I  O'1 1  cm6  s‘f.  The  number  of  0+  ions 
lost  in  a  unit  volume  per  second  is  k]-n[0+]-n[02].  A  typical  value  for  this  rate  of 
disappearance  of  0+  ions  in  the  F-region  at  300  km  is  around  200  s*1. 

The  second  process  is  the  dissociative  recombination  reaction  described  by  the 
expression: 

OZ  +  e  ->  O*  +  O* 

where  O*  is  one  of  several  possible  forms  of  excited  atomic  oxygen.  The  rate  constant 
for  this  reaction  is  k2  =  2.2  x  10-7  (300/Te)°-8  cm6  s'1.  At  300  km,  with  an  electron 
temperature  Te  =900  K,  this  rate  is  about  9.13X10"8  cm6  s'1,  considerably  faster  than  the 
charge  exchange  reaction. 

This  means  that  as  soon  as  the  charge  exchange  reaction  occurs,  it  is  immediately 
followed  by  a  recombination  reaction  effectively  removing  the  electron  and  positive  ion 
and,  at  the  same  time,  leaving  behind  two  excited  neutral  atomic  oxygen  atoms  which 
ultimately  radiate  photons,  with  a  high  probability,  at  the  630. Own  red  line.  Actually 
these  excited  atomic  oxygen  atoms  are  in  a  forbidden  state  with  a  relatively  long  half  life 
of  1 10  s,  which  is  important  in  terms  of  the  quenching  process  discussed  below. 

Reasonable  standard  models  for  the  neutral  02  and  N2  density  and  neutral 
temperature  profiles  used  in  our  modeling  effort  are  given  in  Figures  1  and  2. 


34  5  6  78  9  10  11 

Atmospheric  density  (cm-5) 

Figure  1 .  Model  of  the  density  of  the  neutral  constituents  (molecular  oxygen  and 
nitrogen)  the  atmosphere  vs.  altitude. 
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Temperature  (K) 

Figure  2.  Model  of  the  neutral  atmospheric  temperature  vs.  altitude. 

The  vertical  electron  density  profile  for  the  background  ionosphere  (as  opposed  to  inside 
the  depletion  bands)  is  illustrated  in  Figure  3. 


Electron  density  (cm'5) 

Figure  3.  Assumed  background  ionospheric  electron  density  vs.  altitude  for  Agua  Verde, 
Chile  on  Oct.  1, 1994  at  01  UT. 

Another  important  factor  in  calculating  the  background  airglow  emission  is  the  so 
called  "quenching"  process  which  deactivates  the  excited  atomic  oxygen  atoms  by 
collisions  with  other  constituents  of  the  atmosphere  before  the  excited  atom  is  able  to 
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radiate.  The  approach  presented  by  Sheehan  (private  communication)  is  used  here; 
considering  only  collisions  between  the  excited  atomic  oxygen  atoms  and  02,  N2  and 
electrons.  The  resulting  expression  for  the  volumetric  emission  rate  for  630.0  nm 
photons,  taking  into  account  both  the  chemical  excitation  processes  and  the  collisions 
which  prevent  the  radiative  emission  is: 

p_ _ 0,86  k2  n(02)  ne _ 

e  f  k3n(N2)  +  k4n(Q2)  +  k5ne 
6.81  *  103 

where  k3,  kf  and  k5  are  the  collisional  deactivation  rates  for  N2,  02  and  electrons, 
respectively.  Actually  in  the  region  below  300  km,  where  quenching  is  important, 
collisions  with  the  neutral  N2  dominate.  Figure  4  shows  the  calculated  volumetric 
emission  rate  for  the  630.0  nm  airglow  using  the  above  atmospheric  models. 


Figure  4.  Calculated  background  volumetric  emission  rate  at  630.0  nm. 

Under  these  conditions  the  peak  emission  originates  in  the  vicinity  of  285  km  falling  off 
by  a  factor  of  five  in  ±  50km. 

The  volumetric  emission  fall-off  below  the  peak  results  from  the  rapid  quenching 
of  the  excited  oxygen  atoms  by  collision  with  the  dominant  neutral  constituents  of  the 
atmosphere  before  the  O*  can  radiate.  The  decreased  emission  above  300  km  results 
from  the  rapidly  decreasing  density  of  molecular  oxygen,  resulting  in  less  charge 
exchange,  resulting  in  less  excited  atomic  oxygen  and,  therefore,  less  emission. 
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The  integrated  vertical  emission  of  630.0  nm  is  about  42.2  R  at  Agua  Verde, 
which  corresponds  well  with  other  observations  of  mid-latitude  airglow. 

The  all-sky  imager,  in  viewing  a  full  180°  about  the  local  zenith,  integrates  the 
volumetric  emission  along  each  “look”  angle,  and  the  resultant  intensity  is  located  at  a 
unique  position  on  the  “film”  plane,  as  discussed  in  detail  in  Section  2.3. 

2.3.  Ground-Based  Optical  Imaging  System 

The  HAARP  all-sky  scanning  photometer  system  records  images  in  a  256  x  256 
pixel  format,  with  12  bits  of  stored  information  per  pixel,  and  scans  up  to  four 
wavelengths  every  minute.  As  discussed  above,  these  optical  images  result  from 
“airglow”  emissions  that  occur  principally  between  250  km  and  300  km  in  altitude. 

The  imager  uses  a  lens  system  that  has  the  capability  to  "photograph"  the  entire 
sky  from  horizon-to-horizon.  The  emissions  involved  here  are  relatively  weak  and 
require  several  levels  of  amplification  to  achieve  a  recordable  image.  A  CCD  detector  is 
used  with  the  256  x  256  pixels  followed  by  an  intensifier  with  a  variable  gain  control. 
This  system  has  been  calibrated  by  the  developers  of  the  HAARP  imager  system,  KEO 
Associates.  To  convert  the  measured  counts  to  a  photon  emission  rate  measured  in 
Rayleighs  (one  Rayleigh  corresponds  to  a  photon  flux  of  106/cm2/s),  the  calibration 
curves,  which  depend  on  gain  setting  and  exposure  time,  were  taken  from  documentation 
supplied  KEO  Associates  (private  communication),  who  built  the  system. 

For  the  analysis  of  the  depletion  images,  it  is  important  to  establish  the 
geometrical  relationships  between  the  recorded  optical  image  and  the  location  and 
motions  of  the  emitting  sources  in  the  ionosphere.  This  analysis  was  carried  out  assuming 
the  emitting  sources  in  the  F-region  are  at  an  altitude  of  300  km.  Of  the  256  x  256  pixels 
on  the  CCD  image  plane,  pixel  #128  represents  looking  overhead  and  pixels  #1  and  #256 
represent  the  horizon.  The  relationship  is  expressed  as: 

.  fp-  128)  ... 

y\f  =  arcsin  I  12g  (3) 

where  \\f  is  the  look  angle  from  the  site  to  the  source  measured  from  the  vertical,  and  p  is 
the  pixel  number.  The  geometry  is  illustrated  in  Figure  5. 
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Figure  5.  Simplified  all-sky  image  geometry  assuming  the  emitting  layer  is  at  altitude  H. 
To  relate  the  pixel  position  p  to  the  distance  of  the  emitting  source  from  the  site  a  second 

expression  for  the  look  angle,  vj/,  and  the  earth  central  angle,  a,  is  derived  as: 

vt/  =  arctan^c6ss1”°LYj  (4) 
where  y  =  Re/Re+h,  where  h=300  km  in  this  situation. 

Eliminating  the  look  angle,  vj/,  between  these  two  equations  yields  an  expression  relating 
the  earth  central  angle,  a,  to  the  pixel  number.  Finally,  the  distance  of  the  emitting 
source  from  the  site  is  D  =  a  Re .  A  plot  of  D  vs.  p  is  shown  in  Figure  6. 


Pixel 

Figure  6.  Relationship  of  the  ground  distance  of  an  emitting  source  to  pixel  position.  The 
straight  line  fit  illustrates  the  linearity  of  the  image  from  overhead  (pixel  128)  out  almost 
half  way  across  the  image  (pixel  60). 
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As  an  example,  an  object  on  the  horizon  at  an  altitude  of  300  km  lies  some  1700  km  from 
the  site.  The  relationship  between  distance  and  pixel  number  is  quite  linear  from  pixel 
#128  (the  overhead  position)  out  to  ±  68  pixels. 

Finally,  it  is  important  to  know  the  velocity  with  which  a  source  moves  over  the 
image,  assuming  a  constant  angular  velocity  of  the  actual  source  measured  at  the  center 
of  the  earth.  This  can  be  applied  to  either  depletions,  which  typically  move  from  west  to 
east,  or  to  stars  in  the  background,  which  move  from  east  to  west  as  the  earth  rotates. 
Here  the  expression  for  the  apparent  velocity  is: 

===f  s(a)  (6) 

+  1 

where,  s(a)  =  ^-vj/(a)  . 


v«(a)  =  v 


60  xD 


1000  *  (Re  +  H) 


sin  (a) 
cos  (a)  -  y 


For  a  typical  depletion  band  at  an  altitude  of  300  km  moving  at  a  speed  of  80  m/s  the 
apparent  velocity  across  the  image  is  shown  in  Figure  7. 
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Figure  7.  The  apparent  source  velocity  (pixels/min)  vs.  pixel  position  for  a  source  at  300 
km  moving  with  a  constant  velocity  of  80  m/s  in  the  west  to  east  direction. 

At  the  center  of  the  image,  the  depletion  moves  at  the  maximum  speed  of  about  2 
pixels/minute.  The  apparent  speed  is  less  as  the  source  moves  closer  to  the  edges  of  the 
image,  i.e.,  nearer  to  the  horizon. 
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3.  Model  of  Electron  Density  Depletions 

We  designed  this  model  to  represent  the  electron  density  distribution  within  the 

depletion  region  in  terms  of  the  altitude  z  in  kilometers,  and  longitude  L  in  degrees, 
measured  from  the  longitude  of  the  center  of  the  depletion  L0 .  It  is  represented  as: 

ne(z,L)  =  C(L)  exp 

where  Zm(L)  is  the  peak  altitude  as  a  function  of  longitude  within  the  depletion, 

Zo  is  the  peak  height  in  the  background  F-layer, 

D(L)  is  the  height  scale  within  the  depletion, 

D0,  the  height  scale  in  the  background  ionosphere  outside  the  depletion, 

Dc  is  the  height  scale  at  the  center  of  the  depletion, 

C(L)  is  the  electron  density  scale  factor. 

The  expressions  for  these  parameters  are: 


z~zm(L) 

D(L) 


zm(L)  =  (zmax-  zc)  exp 


( 
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where  F  is  the  depletion  width,  measured  in  degrees  longitude,  and  p  is  the  steepness 
factor  affecting  the  transition  of  the  three  structural  parameters  from  depletion  to  the 
background  ionosphere. 
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Figure  8  shows  a  longitudinal  cut  through  the  band  which  is  assumed  uniform  in 
the  north/south  direction,  i.e.,  it  is  assumed  that  the  depletion  is  aligned  with  the 
magnetic  field. 


In  Figure  8,  L0=289  degrees,  Zo=440  km,  D=30  km,  F=0.45  km  and  p=6. 


Longitude  (E) 

Figure  8.  Two  dimensional  electron  density  distribution  along  an  east-west  line 
combining  the  background  ionosphere  described  earlier  with  a  model  of  the  depletion 
region.  The  contour  are  log  (electron  density). 

In  this  particular  model,  the  peak  electron  density  of  1.6xl06  cm'3  is  at  an  altitude  of  340 
km  in  the  background  ionosphere,  while  within  the  depletion,  the  peak  is  at  400  km  with 
a  density  of  2.0xl05  cm'3.  Although  the  parameters  of  the  background  ionosphere  are 
measured  by  the  vertical  soundings,  the  electron  density  profile  within  the  depletion  can 
only  be  modeled  consistent  with  the  observations  of  the  optical  imager  and  other 
ancillary  information.  Direct  measurement  by  the  sounder  within  the  depletion  is  not 
possible,  as  will  be  discussed  later. 

Using  these  electron  density  profiles  and  the  model  for  the  neutral  atmosphere, 
the  volumetric  emission  rate  of  630.0  nm  can  be  calculated  using  Equation  ( 1 ).  Then  the 
integrated  emission  can  be  calculated  for  each  ray  path  from  the  all-sky  imager  through 
the  emitting  volume. 
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4.  Analyses  of  Observed  Depletion  Bands 


4.1.  Observed  Depletion  Bands 

The  depletions  we  analyzed  were  recorded  during  the  field  campaign  conducted 
in  late  September  and  early  October,  1994  as  part  of  the  MISETA  program  by  Phillips 
Laboratory  using  the  HAARP  all-sky  imager.  The  observing  site  at  Agua  Verde  (25.5  S, 
70. 1W)  lies  almost  on  the  same  magnetic  field  line  as  the  Jicamarca  Ionospheric 
Observatory  on  the  magnetic  equator  in  Peru.  This  allows  the  incoherent  scatter  radar 
observations  at  Jicamarca  recorded  at  the  same  time  to  be  directly  linked  with  our 
observations. 

Only  the  nights  of  October  1  and  October  3,  1994  showed  depletion  activity. 
The  image  recording  started  shortly  after  sunset  (at  approximately  00  UT)  when  the  sky 
became  sufficiently  dark  to  detect  the  background  airglow  and  continued  until  the 
airglow  intensity  became  undetectable,  around  local  midnight,  when  the  background  F- 
region  ionization  had  sufficiently  decayed.  Figure  9  shows  the  all-sky  image  for  October 
1  at  xxxx  UT  using  the  630.0  nm  filter.  The  dark  band  is  the  depleted  region  viewed 
against  the  background  airglow  emission. 
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Figure  9.  All-sky  image  at  630.0  nm  made  at  Agua  Verde,  Chile  on  Oct.  1, 1994  at  0130 
UT.  Visible  are  two  dark  depletion  bands  extending  north-south  across  the  image. 
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4.2.  Analysis  of  the  Shape  and  Motion  of  the  Depletions  Bands 

One  of  the  most  obvious  characteristics  of  the  equatorial  depletions  is  their  west 

to  east  motion.  After  sunset  when  these  depletions  begin  to  form  as  bubbles  rising  over 
the  magnetic  equator,  they  begin  to  drift  eastward  as  a  result  of  a  vertical  electric  field  in 
the  F-region.  This  ExB  motion  drives  them  across  the  all-sky  image  at  velocities  of  the 
order  of  100  m/s. 

The  depletions  were  tracked  for  approximately  100  minutes  on  Oct.  1,  1994  and 
for  only  about  35  minutes  on  Oct.  3,  1994.  Depletions  only  appeared  on  these  two 
nights.  This  relatively  short  duration  was  caused  by  the  general  weakening  of  the 
background  emission.  As  discussed  earlier,  this  makes  the  depletions  almost  invisible  as 
the  background  emission  becomes  as  dark  as  the  depletion. 

Using  the  recorded  one  minute  sequence  of  optical  images  at  a  wavelength  of 
630.0  nm,  we  processed  a  cut  through  the  north-south  center  of  each  image,  averaging 
across  the  256  pixels  using  seven  rows  #125  through  #131  to  generate  an  smoothed  east- 
west  cut  through  the  image.  A  typical  cut  is  shown  in  Figure  10  where  we  see  two  low 
emission  depletion  bands,  one  in  the  western  sky  and  a  second  one  near  overhead. 


Figure  10.  All-sky  image  intensity  across  the  center  in  the  east- west  direction  for  Oct.  1, 
1994  at  0130  UT  (see  Figure  9). 

The  amplitudes  are  the  integrated  emission  intensity  measured  in  Rayleighs.  For 
the  data  collected  during  the  MISETA  campaign,  the  gain  of  the  intensifier  was  held  at  2 
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and  the  exposure  time  was  a  constant  4  seconds.  This  results  in  a  conversion  factor  of 
40  recorded  counts  =  1  Rayleigh.  With  these  images,  which  are  one  minute  apart,  the 
depletions  can  be  tracked  by  generating  a  contour  plot  of  emission  intensity  as  a  function 
of  pixel  number  and  time.  The  horizontal  cuts  are  stacked  vertically  as  a  function  of  time 
and  the  resulting  composite  plots  for  Oct.  1,  1994  are  shown  in  Figure  11.  We  see,  as  a 
function  of  time,  the  motion  of  the  depletions  from  west  to  east.  When  the  coordinates  of 
the  depletion  are  in  pixel  numbers,  there  is  a  distortion  caused  by  the  all-sky  lens,  making 
the  depletions  appear  narrower  when  they  are  far  from  the  center  of  the  image.  This 
effect  was  illustrated  earlier  in  Figure  7  where  the  horizontal  distances  and  velocity  are 
compressed  near  the  edges  of  the  image.  The  entire  time  sequence  is  shown  including 
those  times  when  the  depletions  are  near  the  edges  of  the  image.  The  "S"  shaped  dark 
low  intensity  bands  show  the  motion  of  the  depletions  as  they  move  across  the  all-sky 
image.  In  these  plots  it  is  also  possible  to  see  the  motion  of  individual  bright  stars  and 
the  milky  way  as  these  features  progress  from  east  to  west  as  the  earth  rotates. 
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Figure  11.  Composite  of  crosscuts  from  the  Agua  Verde,  630.0  ran  images  for  Oct.  1, 
1994  showing  the  motion  of  depletions  as  they  move  from  west  to  east.  The  origin  for  the 
time  axis  is  at  00UT. 
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As  part  of  this  analysis,  the  intensity  cuts  across  the  center  of  the  image  were 
again  smoothed  to  remove  the  noisy  characteristics  of  the  amplified  optical  signal  by 
fitting  these  data  points  with  a  cubic  spline  interpolation.  We  then  used  the  location  of 
its  minimum  to  determine  the  centers  of  the  depletion  bands,  and  we  determined  the 
widths  of  these  depletions  by  using  the  separation  of  the  inflection  points  on  both  sides  of 
the  minimum.  The  difference,  in  pixels,  between  the  two  inflections  was  then  converted 
to  the  width  of  the  depletion  in  kilometers  using  Equations  3  and  4. 

Fitting  these  cross-cut  data  with  the  cubic  spline  technique  and  searching  for  the 
local  minima  resulted  in  the  curves,  shown  in  Figure  12.  Each  curve  represents  the 
position  of  the  minimum  associated  with  each  depletion  of  the  four  depletions  as  they 
move  across  the  field  of  view  as  a  function  of  time.  The  time  origin  for  each  curve  is 
00  UT.  When  the  depletions  are  within  the  linear  region  of  the  optical  image,  the  slope 
of  the  curve  is  related  to  the  mean  drift  velocity  of  that  depletion. 


Figure  12.  Motion  of  the  depletion  centers  as  a  function  of  time. 

For  the  two  main  depletions  observed  on  October  1,  the  velocity  was  67  m/s 
averaged  over  their  observed  lifetime.  Only  relatively  small  noise-like  fluctuations  are 
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observed  in  the  velocity  of  these  two  depletions.  The  width  of  these  depletions  was,  on 
the  average,  12  0  km. 

The  geometrical  velocity  variation  discussed  in  the  previous  section  of  this  report 
is  not  apparent  on  these  plots,  since  the  scaled  data  were  restricted  to  the  more  central, 
linear  portion  of  the  image,  purposefully  excluding  those  times  when  the  180  degree  lens 
distortion  significantly  slows  the  apparent  motion  of  the  depletion,  as  was  seen  in 
Figure  7. 


4.3.  Model-based  Analysis 

Using  our  model  based  approach  we  produced  synthetic  image  cross-sections  that 
fit  as  closely  as  possible  the  cross-sections  obtained  the  actual  depletions  at  OOxx  and 
OOyy  UT  .  Figure  13a  shows  the  results  when  a  depletion  is  almost  overhead  with  the 
synthetic  cross-section  adjusted  and  shifted  to  align  with  it. 

The  underlying  physical  situation  for  the  model  adjusted  to  fit  the  empirical 
depletions  are  shown  in  Sections  b  and  c  of  Figure  13.  In  Section  b  we  show  the  two 
dimensional  structure  of  the  electron  density  required  to  achieve  the  fit  to  the  empirical 
data,  both  the  background  and  within  the  depletion  and  Section  c  shows  the  derived 
volumetric  emission  profile. 

These  two  physical  descriptions  constitute  the  end  product  sought  in  this  project. 
They  provide,  using  an  inverse  analysis  technique,  new  continuous  information  on  the 
volumetric  structure  of  the  depletions  available  in  optical  all-sky  images.  We  conducted 
a  quantitative  analysis  using  this  model-based  approach  to  produce  synthetic  image  cross- 
sections  that  fit  as  closely  as  possible  the  cross-sections  obtained  from  the  actual 
depletions. 


Figure  13. 

We  have  overlaid  the  actual  cross-sections  onto  the  calculated  emission  intensity  across 
the  center  of  the  image.  The  model  was  shifted  to  align  the  minimum  of  the 
measurements  with  the  minimum  of  the  simulated  emissions.  The  important  quantitative 
comparisons  are: 

1 .  The  background  intensity  level. 
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2.  The  minimum  intensity  within  the  depletion. 

3.  The  slope  of  the  transition  between  the  two  regions. 

The  background  airglow  level  was  first  matched  by  modifying  the  measured 
ambient  electron  density  profile  within  reasonable  bounds.  When  this  was  completed, 
attention  was  turned  to  the  structure  of  the  electron  density  depletion,  and  the  adjustable 
parameters  discussed  in  Section  3  are  varied,  again  in  a  reasonable  manner  to  match  the 
slopes  of  the  emission  profile,  and  most  importantly,  the  emission  level  within  the 
depletion.  The  program  resides  within  IBM  Data  Explorer  and  provides  a  set  of  “knobs” 
and  a  real-time  interactive  capability  to  accomplish  this  fitting  process. 

To  ensure  that  the  measured  intensity  at  the  darkest  part  of  the  depletion  is  an 
actual  emission  level  above  the  system-minimum-detectable  intensity,  several  images 
were  scanned  to  determine  the  distribution  of  measured  intensities.  The  important  result 
is  that  the  HAARP  imaging  system  seems  capable  of  measuring  down  to  approximately 
10  R  or  less  and  the  residual  emission  within  the  depletion  is  real  reaching  15  to  20  R. 


5.  Conclusions 

We  have  developed  new  analysis  tools  that  can  be  used  to  describe  the  structure 
and  motion  of  equatorial  depletion  bands.  We  are  able  to  process  and  analyze  relatively 
large  amounts  of  optical  all-sky  photometer  imagery,  particularly  in  this  case,  equatorial 
depletion  bands.  This  processing  technique  involves  smoothing  (using  spatial  averaging 
only)  and  curve  fitting,  making  it  possible  to  use  simple  mathematical  methods  to 
determine  the  speed  and  structure  sizes  of  the  depletion  bands. 

We  also  developed  a  simulation  technique  that  makes  it  possible  to  infer  some  of  the 
characteristics  of  the  3-D  structure  of  the  depletion  bands  based  on  the  2-D  optical 
images.  The  basic  parametric  electron  density  model  we  developed  on  the  Silicon 
Graphic  work  station  provides  for  real-time  control  of  the  parameters  and  for  visual 
feedback  effects  on  the  image  emission  intensity  profile,  either  as  a  full  2-D  all-sky 
image,  or  as  a  cross-cut  through  the  image  that  can  be  compared  with  the  smoothed 
empirical  image  data. 

It  appears  that  the  depletions  observed  on  the  two  nights  during  the  MISETA 
campaign  in  Chile,  away  from  the  magnetic  equator,  are  relatively  shallow.  Emission 
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within  the  depletion  is  only  a  factor  of  2  down  from  the  ambient  airglow.  This  may 
result  from  the  relatively  long  time  since  these  depletions  formed  and  to  fill-in  by 
diffusion.  But,  other  observations  have  shown  that  the  electron  density  within  these 
regions  is  highly  irregular  at  these  times  and  clearly  causes  scattering  and  scintillation  on 
radio  signals  passing  through  these  bands  (Sales  et  al.,  1996). 

At  this  point,  the  number  of  depletions  recorded  and  analyzed  is  very  small, 
making  it  difficult  to  draw  reliable  conclusions  concerning  the  structure  of  the  equatorial 
depletion  bands,  but  some  insight  has  already  been  gained.  Although  it  appears  that  the 
internal  shape  of  the  depletion  band  is  remarkably  stable  as  the  bands  drift  after  they  are 
formed,  we  have  observed  some  changes  on  October  3,  a  change  in  the  character  of  the 
spread-F  that  initially  was  associated  with  irregularities  within  the  depletion  band, 
ultimately  evolved  into  general  lower  F-region  spread-F,  as  the  band  “grew  old”. 

Several  areas  for  future  research  have  become  obvious.  Besides  collecting  and 
processing  additional  data  to  build  up  the  data  base,  we  ought  to  consider  an  adding  an 
optical  site  closer  to  the  magnetic  equator,  possibly  600  km  south,  to  catch  the  earlier 
development  of  these  bands  before  they  reach  a  stable  configuration.  It  is  during  these 
early  stages  that  the  irregularities,  observed  by  radio  wave  scattering,  are  formed.  The 
observations  at  an  intermediate  site  may  shed  light  on  the  mechanisms  for  the  formation 
and  maintenance  of  these  internal  irregularities  within  the  depletion.  An  important 
question  is  whether  analysis  of  the  structural  changes  within  the  depletion  band,  as 
observed  by  the  optical  system,  such  the  steepening  of  the  walls  affects  the  formation  of 
these  small  and  medium  scale  irregularities? 

The  general  approach  we  have  developed  in  this  program  for  getting  more 
information  out  of  the  optical  data  should  help  in  answering  these  questions. 
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Abstract 


A  photolytic  iodine  laser  is  shown  to  have  a  13.59  GHz  modulation  in  the  laser  radiation 
when  simultaneous  lasing  of  two  hyperfine  transitions  is  obtained.  The  necessary  gain  tuning 
required  to  achieve  dual  line  lasing  is  accomplished  through  the  use  of  a  magnetic  field. 
In  addition,  we  investigate  the  possibility  of  using  a  nonlinear  mirror  to  increase  the  gain 
coupling  between  the  two  transitions. 
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GIGAHERTZ  MODULATION  AND  ULTRAFAST  GAIN  BUILD  UP  IN 

IODINE  LASERS 

Jeff  Nicholson  and  Wolfgang  Rudolph 


1  Introduction 

Gas  lasers  have  received  attention  because  of  their  ability  to  operate  at  very  high  cw  power 
levels  with  good  beam  quality,  and  also  because  they  possess  an  active  media  that  is  not  as 
susceptible  to  damage  as  solid  state  materials.  One  example  of  high  power  gas  lasers,  the 
Chemical  Oxygen  Iodine  Laser  (COIL),  has  been  shown  to  operate  at  power  levels  as  high  as 
40  kW  [1]. 

Modulation  of  these  high  power  lasers  at  gigahertz  frequencies  is  desirable.  Not  only 
does  one  obtain  higher  peak  intensities  in  a  modulated  laser  beam,  but  one  can  think  of 
a  gigahertz  modulated  laser  beam  as  a  radar  wave  with  an  optical  carrier.  Thus,  one  can 
envision  producing  a  radar  signal  that  propagates  with  the  qualities  of  a  coherent  optical 
wave.  Modulating  a  laser  at  gigahertz  frequencies  with  active  elements  is  difficult  because  of 
the  high  bandwidth  requirements.  In  addition,  the  high  intracavity  powers  of,  for  example, 
COIL,  impose  extreme  damage  threshold  requirements  on  intracavity  elements. 

An  alternative  is  to  force  the  laser  to  operate  on  two  different  hyperfine  transitions  of 
atomic  iodine  simultaneously.  If  the  radiation  in  the  two  transitions  is  temporally  coherent, 
the  intensity  of  the  laser  will  be  modulated  at  a  frequency  equal  to  the  frequency  separation 
of  the  two  transitions. 

Furthermore,  previous  experiments  and  computer  simulations  have  shown  that  energy 
extraction  in  high  power  iodine  amplifiers  can  be  improved  by  amplifying  a  pulse  whose 
spectrum  consisted  of  two  separate  frequencies  coincident  with  the  two  strongest  transitions  of 
atomic  iodine  and  suggested  that  the  output  of  such  an  amplifier  would  indeed  be  modulated 
at  gigahertz  frequencies  [2,  3]. 

An  atomic  iodine  laser  operates  on  the  2Pi/2  — 2  P3/2  magnetic  dipole  transition  of  atomic 
iodine,  with  a  wavelength  of  1.315  /zm  (see  Fig.  1).  The  strongest  of  its  six  hyperfine  tran¬ 
sitions,  and  the  one  that  typically  lases  is  the  F'  =  3  to  Fn  =  4.  The  gain  of  the  second 
strongest  transition,  the  F'  =  2  to  F "  =  2,  is  more  than  twice  as  small  as  that  of  the  3-4. 
The  two  transitions  are  separated  by  13.59  GHz. 

Applying  a  magnetic  field  to  atomic  iodine  causes  Zeeman  splitting  of  the  energy  levels, 
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Figure  1:  Energy  levels  and  allowed  transitions  of  atomic  iodine.  The  numbers  on  the  right 
side  of  the  diagram  are  the  designations  used  in  the  computer  simulations. 

removing  their  degeneracies.  For  small  magnetic  fields  (<  Kilogauss)  the  frequencies  of  the 
hyperfine  transitions  are  not  shifted  significantly  [4].  For  this  reason,  the  transitions  are 
still  referred  to  as  the  3-4  and  2-2  transitions,  even  under  the  influence  of  a  magnetic  field. 
The  gain  of  the  two  transitions  can  be  equalized  without  the  need  of  intracavity  elements 
by  applying  a  magnetic  field  of  approximately  350  Gauss  perpendicular  to  the  polarization 
of  the  laser  [5].  With  a  smaller  magnetic  field,  the  3-4  transition  dominates,  with  a  stronger 
field,  the  2-2. 

2  Experimental  Results  and  Discussion 

In  this  paper,  we  show  the  modulation  in  the  laser  radiation  that  results  from  the  simultaneous 
lasing  of  the  2-2  and  3-4  transition  in  an  iodine  laser.  The  experiments  were  performed  with 
a  photolytic  iodine  laser,  a  convenient  tool  for  performing  small  scale  experiments  which  can 
later  be  scaled  up  to  the  high  power  COIL.  A  sketch  of  the  experimental  apparatus  is  shown 
in  Fig.  2.  The  laser  consisted  of  a  cell  of  CF3I  at  8  Torr  pumped  by  a  KrF  excimer  laser 
producing  250  mJ  in  20  ns  pulses  at  248  nm.  As  a  result  of  the  absorption  at  248  nm,  CF3I 
dissociates  into  CF3  and  iodine,  with.  92%  of  the  iodine  produced  in  the  excited  state  (I*)  [6]. 

An  aperture  in  the  cavity  forced  oscillation  on  a  single  transverse  mode.  At  8  torr  iodine 
is  primarily  Doppler  broadened  with  a  Doppler  width  of  approximately  250  MHz  [7]  and  the 
free  spectral  range  of  the  cavity  was  150  MHz,  therefore,  the  laser  operated  on  either  one  or 
two  longitudinal  modes.  A  typical  gain  switched  pulse  is  shown  in  the  inset  of  Fig.  2.  The 
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Figure  2:  The  experimental  setup.  Inset:  a  typical  gain-switched  iodine  laser  pulse  showing 
longitudinal  mode  beating. 

fast  beating  shown,  with  a  period  of  approximately  6  ns,  is  longitudinal  mode  beating. 

A  set  of  ferrite  ceramic  permanent  magnets  was  used  to  apply  a  magnetic  field  to  the 
gain  cell  perpendicular  to  the  polarization  of  the  laser.  The  strength  of  the  magnetic  field 
was  tuned  by  varying  the  distance  between  the  magnets  and  the  gain  cell.  This  produced  a 
magnetic  field  of  up  to  a  Kilogauss  that  varied  less  then  10%  over  the  mode  volume  of  the 
iodine  laser. 

The  single  shot  spectrum  of  the  iodine  pulse  was  measured  with  a  Fabry-Perot  interfer¬ 
ometer.  The  free-spectral-range  of  the  interferometer  was  39  GHz,  and  the  resolution  was 
approximately  2  GHz  at  1.315  pm.  Therefore,  while  the  interferometer  was  capable  of  distin¬ 
guishing  the  hyperfine  transitions,  it  was  not  able  to  resolve  individual  longitudinal  modes. 
A  cylindrical  lens  was  used  to  focus  into  the  Fabry-Perot  and  produce  line  fringes. 

The  temporal  profile  of  the  iodine  laser  pulses  was  recorded  with  a  streak  camera  with 
a  resolution  of  5  ps.  When  used  in  conjunction  with  the  Fabry-Perot  interferometer,  the 
streak  camera  was  able  to  time-resolve  the  spectral  content  of  the  laser  pulse.  This  provided 
details  about  the  temporal  overlap  of  the  laser  radiation  due  to  the  two  different  hyperfine 
transitions. 

Because  the  cameras  were  not  sensitive  in  the  infrared,  a  LHO3  crystal  was  used  to 
frequency  double  the  laser  radiation  to  657.5  nm.  Both  intracavity  and  extra-cavity  doubling 
was  investigated.  Extra-cavity  doubling  had  the  advantage  of  ease  of  alignment.  Placement 
of  the  crystal  in  the  cavity,  however,  becomes  important  when  considering  the  effect  of  the 
nonlinear  crystal  in  conjunction  with  the  laser  resonator  outcoupler  to  create  a  nonlinear 
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Figure  3:  Fabry-Perot  fringes  without  magnetic  field  (solid  line)  and  with  a  400  G  magnetic 
field  (dashed  line)  applied  to  the  active  medium. 

mirror  as  explained  below. 

Fig.  3  shows  the  Fabry-Perot  fringes  obtained  from  the  interferometer.  The  solid  curve 
shows  the  fringes  due  to  the  3-4  line  when  there  was  no  magnetic  field  on  the  gain  cell.  The 
dotted  curve  shows  the  fringes  obtained  when  a  magnetic  field  of  approximately  400  Gauss 
was  applied  to  the  gain  cell,  perpendicular  to  the  laser  oscillation.  Two-line  lasing  is  clearly 
visible.  Because  of  the  second  harmonic  generation,  the  doubled  2-2  line  appears  27  GHz 
below  the  doubled  3-4  line.  The  sum  frequency  occurs  halfway  between  the  doubled  3-4  and 
2-2  lines. 

The  temporal  dependence  of  the  two  laser  lines  for  two  typical  pulses  is  shown  in  Fig.  4. 
The  solid  curves  show  the  3-4  line  and  the  dashed  curves  the  2-2. 

The  two  curves  represent  typical  pulses  observed.  Because  of  the  single  shot  nature  of  the 
laser,  and  because  no  attempt  was  made  to  stabilize  the  excimer  pump  or  the  iodine  laser  in 
any  way,  a  certain  amount  of  variation  in  the  laser  pulses  from  shot  to  shot  was  observed. 
Both  lines  tended  to  operate  with  either  one  or  two  longitudinal  modes  an  equal  amount  of 
time  and  independent  of  the  magnetic  field  strength.  In  addition,  when  the  magnetic  field 
was  such  that  both  hyperfine  transitions  lased,  there  was  a  variation  in  the  temporal  overlap 
of  the  two  transitions  from  shot  to  shot.  Typical  shot  to  shot  fluctuations  in  the  iodine  laser 
pulse  energy  were  less  than  10%. 

Although  the  2-2  and  the  3-4  transitions  do  not  share  any  energy  levels  (see  Fig.  1), 
there  are  fast  relaxations  among  the  upper  and  lower  hyperfine  levels  that  tend  to  equalize 
the  populations.  Population  in  the  upper  hyperfine  levels  is  equalized  through  collisions 
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Figure  4:  Temporal  dependence  of  the  3-4  (solid  line)  and  the  2-2  (dashed  line)  transitions 
as  measured  by  the  streak  camera  for  two  different  laser  pulses. 

with  ground  state  iodine  and  can  take  place  with  a  time  constant  on  the  order  of  tens  of 
nanoseconds;  relaxation  among  the  lower  hyperfine  levels  proceeds  more  directly  through 
collisions  with  any  species  in  the  gain  cell,  and  has  a  time  constant  of  a  few  nanoseconds  [8]. 
If  the  gain  of  the  two  transitions  is  not  exactly  equal,  the  stronger  will  build  up  first,  and 
collisional  relaxations  will  remove  the  population  from  the  weaker,  effectively  decreasing  the 
gain  of  the  weaker  transition  even  further.  Any  small  change  in  the  magnetic  field  away  from 
the  crossing  point  in  the  gain  of  the  transitions  leads  to  one  of  the  transitions  dominating 
over  the  other  [9].  This  meant  that  simultaneous  lasing  of  both  transitions  was  only  seen  in 
a  narrow  range  of  magnetic  field  strength.  Strong  lasing  of  both  transitions  was  observed 
from  approximately  360  to  390  Gauss  (corresponding  to  moving  the  permanent  magnet  0.5 
cm).  However,  within  this  region,  there  was  always  a  certain  amount  of  temporal  overlap 
because  the  relaxations  among  hyperfine  levels  forced  the  gain  of  the  two  transitions  to 
be  approximately  equal  in  order  to  observe  lasing  of  both  transitions.  Fig.  4a  depicts  the 
approximate  minimum  amount  of  temporal  overlap  observed.  Fig.  4b  shows  a  more  typical 
case.  Although  the  2-2  transition  is  the  first  to  build  up  in  both  pulses  of  Fig.  4,  this  could  be 
changed  by  tuning  the  magnetic  field  strength,  and  pulses  where  the  3-4  transition  dominated 
were  also  observed  by  simply  adjusting  the  position  of  the  permanent  magnet. 

In  Fig.  4a,  the  3-4  transition  shows  beating  from  multiple  longitudinal  modes,  whereas 
the  2-2  transition  has  only  a  single  longitudinal  mode.  In  contrast,  both  the  3-4  and  the 
2-2  transitions  show  longitudinal  mode  beating  in  Fig.  4b.  Fig.  4  illustrates  an  inherent 
problem  in  producing  the  desired  13.6  GHz  modulation.  In  order  to  have  a  maximum  depth 
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Figure  5:  Gigahertz  mode  beating  of  the  laser  output. 

of  modulation  over  the  entire  length  of  the  laser  pulse,  the  two  frequencies  must  overlap 
temporally  and  be  of  equal  intensity.  In  Fig.  4a,  the  temporal  overlap  of  the  3-4  and  2-2  is 
only  partial.  In  the  case  of  Fig.  4b,  even  though  the  envelopes  of  the  two  transitions  overlap, 
longitudinal  mode  beats  in  each  of  the  transitions  do  not.  Therefore,  this  pulse  would  not 
show  a  substantial  spectral  component  at  13.6  GHz. 

When  the  magnetic  field  was  adjusted  for  optimum  temporal  overlap  of  the  3-4  and  2-2 
transitions,  and  the  pulse  is  viewed  with  a  fast  streak  camera  with  a  resolution  of  5  ps,  a 
clear  13.6  GHz  modulation  is  observed,  as  is  shown  in  Fig.  5.  This  data  was  taken  with  the 
second  harmonic  generation  performed  extra-cavity. 

Another  possibility  to  record  the  GHz  modebeating  is  the  mixing  of  the  optical  signal 
with  a  local  oscillator.  To  this  aim  we  detected  the  second  harmonic  of  the  iodine  laser  with 
a  20  ps  photodiode.  Since  the  diode  is  a  quadratic  detector  the  current  exhibits  the  13.6 
GHz  modulation.  This  current  was  mixed  with  a  local  oscillator  tunable  around  13.6  GHz  so 
that  the  resulting  difference  frequency  could  be  detected  directly  by  a  400  MHz  oscilloscope. 
Figure  locoscill  shows  an  oscilloscope  trace.  The  laser  operated  at  two  longitudinal  modes  on 
each  hyperfine  transition  which  explains  the  modulation  of  the  beat  amplitude. 

3  Potentials  of  a  nonlinear  mirror  for  mode  coupling 

The  13.6  GHz  modulation  is  observed  without  any  extra  coupling  between  the  3-4  and  2-2 
transitions.  However,  because  of  the  above  mentioned  shot  to  shot  variation  in  the  overlap  of 
the  two  transitions  and  to  ease  the  requirements  of  tuning  the  magnetic  field,  an  additional 
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Figure  6:  Signal  after  mixing  the  output  of  a  20  ps  photodiode  illuminated  by  the  second 
harmonic  of  the  iodine  laser  with  a  local  oscillator  tuned  to  14  GHz. 

coupling  between  the  transitions  is  desirable.  The  possibility  of  using  a  nonlinear  mirror  to 
equalize  the  gain  of  the  two  transitions  was  investigated.  A  schematic  of  the  mirror  is  shown 
in  the  inset  of  Fig.  7.  Such  a  mirror  was  first  used  to  mode-lock  a  Nd:YAG  laser  [10].  The 
nonlinear  mirror  consists  of  a  second  harmonic  crystal,  and  a  dichroic  mirror  as  the  outcoupler 
for  the  resonator.  Typical  reflectivities  for  the  outcoupler  are  90%  for  the  fundamental 
frequency  and  100%  for  the  second  harmonics.  The  distance  between  the  mirror  and  the 
second  harmonic  crystal  is  such  that  the  difference  in  the  index  of  refraction  for  fundamental 
and  sum  frequencies  introduces  a  n  phase  change  between  the  two  upon  re-entrance  into  the 
crystal.  The  n  phase  change  means  that  after  converting  IR  to  red  in  the  first  pass  through 
the  nonlinear  crystal  and  reflecting  off  the  outcoupler,  the  red  is  converted  back  to  IR  in  the 
second  pass  through  the  crystal.  The  result  is  an  energy  transfer  from  the  stronger  laser  line 
to  the  weaker. 

A  computer  simulation  was  developed  in  order  to  investigate  the  degree  of  effective  gain 
coupling  possible  with  the  nonlinear  mirror.  The  computer  simulation  was  based  on  the  usual 
rate  equations  for  describing  the  interaction  with  the  gain  medium  and  the  outcoupler  [11], 

^-N4  =  -alFl(N4-N2)  +  -(N3-N4), 

dt 

4 -N2  =  CT1F1(N4-N2)  +  -(N1-N2), 

at  Ti 

^-N3  =  -a2F2  (N3  -  Nx)  -  -  (JV3  -  N4) , 

(it 
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(1) 

(2) 

(3) 


Figure  7:  Simulation  of  the  nonlinear  mirror.  Solid  curves  are  the  2-2  and  3-4  (stronger) 
transitions  without  the  nonlinear  mirror.  Dashed  curves  are  with  the  nonlinear  mirror.  The 
inset  shows  a  schematic  of  the  mirror.  Reflectivities  for  the  dichroic  outcoupler  are  90%  for 
the  IR  and  100%  for  the  red. 

JtNx  =  a2F2(N3-N1)-^(N1-  N2) ,  (4) 

jFx  =  a1cF1  (N4  -  N2)  -  (5) 

jtF2  =  <j2cF2(Nz-Nx)-y,  (6) 

where  <7i  and  a2  are  stimulated  emission  cross  section  for  the  3-4  and  2-2  transitions,  F\ 
and  F2  are  the  photon  flux  of  the  3-4  and  2-2  transitions,  ru  and  77  are  the  time  constants 
for  relaxation  among  upper  and  lower  hyperfine  levels,  and  rr  is  the  resonator  lifetime.  The 
various  populations  N{  refer  to  the  designations  on  the  right  side  of  Fig.  1.  Because  of  the 
short  pump  pulse,  the  simulation  is  started  with  the  pumping  process  completed  and  the 
population  already  in  the  excited  state.  In  addition,  the  time  scale  of  the  iodine  laser  pulse 
was  much  faster  than  any  chemical  kinetics  in  the  active  medium  (such  as  quenching  of  I*  due 
to  the  formation  of  I2)  [12]  and  all  such  chemical  processes  were  ignored  in  the  rate  equation 
model.  As  in  other  rate  equation  models  of  atomic  iodine  lasers,  the  F"  =  2  and  F"  =  1 
levels  are  lumped  into  the  lower  laser  levels  because  of  the  fast  relaxation  among  the  lower 
hyperfine  levels  [13].  The  model  assumed  the  laser  operated  on  a  single  longitudinal  mode. 

Between  two  time  steps  (the  resonator  round  trip  time)  the  photon  flux  was  modified  to 
account  for  the  interaction  with  the  nonlinear  mirror.  The  interaction  of  the  laser  radiation 
with  the  LiIC>3  crystal  was  modeled  using  the  coupled  wave  equations  for  second  harmonic 
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and  sum  frequency  generation.  The  equations  for  the  complex  field  amplitude  are  [14] 


d  A 

dzAl 

=  Mi  Am  +  MsfAj, 

(7) 

d  ~ 

5* 

=  M2A.D2  +  M5M1, 

(8) 

j-Adi 

dz 

=  Mi, 

(9) 

d  - 

-7-AD  2 
dz 

=  Ma, 

(10) 

d  A 

-j-Asf 

dz 

=  MiA2. 

(11) 

The  subscripts  1  and  2  again  refer  to  the  3-4  and  2-2  transitions,  D1  and  D2  refer  to  the 
doubled  frequencies,  and  SF  to  the  sum  frequency,  fj,  is  a  nonlinear  coefficient,  proportional 
to  the  nonlinear  susceptibility  x The  actual  equations  used  in  the  computer  program  were 
the  separate  equations  for  the  real  field  amplitude  A  and  phase  (f>  (see,  for  example,  [15]), 
where  A  =  Ae 

The  simulation  first  calculated  the  effect  of  the  gain  media  using  Eqs.  (1)  -  (6),  starting 
from  spontaneous  emission.  Then  the  first  passage  through  the  nonlinear  crystal  was  com¬ 
puted  using  Eqs.  (7)  -  (11).  At  this  point,  part  of  the  energy  at  the  fundamental  frequencies 
(approximately  10%)  was  coupled  out  of  the  cavity,  and  an  additional  phase  change  was 
introduced  between  the  fundamental  and  second  harmonic  frequencies.  The  second  passage 
through  the  crystal  was  then  calculated.  Because  the  second  harmonic  and  sum  frequency 
were  not  resonated,  any  remaining  energy  in  the  red  was  lost  to  the  laser.  For  this  reason, 
when  the  phase  change  after  the  first  passage  through  the  nonlinear  crystal  was  set  to  2n,  for 
high  enough  values  of  fi  the  laser  was  driven  below  threshold  and  was  shut  off  prematurely. 

Results  from  the  simulation  are  summarized  in  Fig.  7.  The  solid  lines  show  the  temporal 
dependence  of  the  2-2  and  3-4  transitions  when  there  is  no  nonlinear  crystal  in  the  cavity 
and  the  gain  cross  section  of  the  2-2  transition  is  90%  of  the  3-4  transition.  The  dashed  lines 
show  the  profiles  when  the  crystal  is  placed  in  the  cavity.  In  both  cases,  the  smaller  pulse 
is  the  2-2  and  the  larger  is  the  3-4.  All  curves  have  been  normalized  to  the  peak  of  the  3-4 
pulse  obtained  with  no  nonlinear  mirror.  The  case  with  the  nonlinear  mirror  shows  a  relative 
increase  in  the  2-2  profile  compared  to  the  3-4,  demonstrating  the  coupling  ability  of  the 
nonlinear  mirror. 

The  expected  depth  of  the  hyperfine  mode  beating  is  given  by  the  visibility  of  the  gigahertz 
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Figure  8:  The  visibility  of  the  GHz  mode  beating  as  a  function  of  time  for  different  nonlinear 
coefficients. 

modulation  in  the  total  photon  flux: 

y/.\  F maz(t)  F min  (t)  _  2y/F1(t)F3(t) 

U  Fmax(t)  +  Fmin(t)  F1(t)  +  F2(t)'  [  1 

where  the  total  photon  flux  is  given  by  F(t)  oc  |i4i(t)  +  A2(t)\2.  Fmax(t )  and  Fm:n(t)  are  the 
maximum  and  minimum  values  obtained  at  time  t  due  to  the  gigahertz  modulation.  V(t)  is 
plotted  in  Fig.  8  for  several  values  of  the  nonlinear  coefficient  /i.  With  an  increasing  nonlinear 
coefficient,  the  visibility  of  the  fringes  remains  near  the  maximum  value  of  one  in  the  tail 
of  the  pulse,  meaning  the  depth  of  modulation  remains  close  to  100%  for  the  length  of  the 
pulse.  The  peak  value  of  /z  =  0.005  corresponds  to  a  single  pass  conversion  efficiency  from 
IR  to  red  of  approximately  80%. 

It  was  demonstrated  that  the  nonlinear  mirror  can  increase  the  coupling  between  the  two 
hyperfine  transitions  in  our  pulsed  iodine  laser.  In  a  cw  iodine  laser,  such  as  a  flashlamp 
pumped  iodine  laser  or  a  COIL,  the  possibilities  of  the  nonlinear  mirror  can  be  extended 
even  further.  First,  one  could  use  the  simple  photolytic  iodine  laser  described  above  to 
generate  a  seed  pulse  for  a  COIL.  This  way,  a  magnetic  field  can  be  used  to  tune  the  gain 
of  the  COIL  to  the  region  of  optimal  overlap  of  the  2-2  and  3-4  transitions.  The  seed 
pulse  can  then  provide  the  maximum  depth  of  modulation  without  any  intracavity  elements 
in  the  COIL.  In  a  cw  flashlamp  pumped  iodine  laser,  the  nonlinear  mirror  could  be  used 
for  both  coupling  the  hyperfine  transitions  together  and  mode-locking  of  the  longitudinal 
modes.  Mode-locking  occurs  in  the  nonlinear  mirror  because  longitudinal  modes  in  the  first 
pass  through  the  nonlinear  crystal  undergo  sum  frequency  generation,  and  on  the  second 
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pass  the  difference  frequency  generation  produces  new  modes  separated  by  the  longitudinal 
mode  spacing  and  with  the  phase  locked  to  the  original  modes  [15].  Thus,  not  only  is  the 
peak  intensity  enhanced  even  further,  but  it  is  expected  that,  in  a  manner  analogous  to 
mode-locking  longitudinal  modes  under  a  single  transition,  the  nonlinear  mirror  will  lock  the 
longitudinal  modes  of  both  hyperfine  transitions  together,  eliminating  the  problem  shown  in 
Fig.  4b.  A  planned  extension  to  the  computer  simulation  to  check  this  assumption  is  to  allow 
more  than  one  longitudinal  mode  to  oscillate. 

In  conclusion,  we  have  demonstrated  an  intrinsic  gigahertz  modulation  that  occurs  in 
an  atomic  iodine  laser.  The  gain  of  two  different  hyperfine  transitions  can  be  equalized 
through  the  use  of  a  magnetic  field,  allowing  them  to  lase  simultaneously  and  beat  together. 
Additional  coupling  of  these  transitions  is  desired  to  achieve  consistent  operation  and  can  be 
achieved  through  a  nonlinear  mirror  based  on  intracavity  sum  frequency  generation. 
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Inversion  of  Atmospheric  Radiance  Measurements  for  the  Measurement  of 
Temperature,  Turbulence,  and  Velocity 

Abstract 

Obtaining  atmospheric  parameters  from  radiance  measurements  consists  of  ill-conditioned 
problems  that  require  appropriate  inversion  methods.  This  report  analyzes  the  solution  methods 
for  atmospheric  temperature,  turbulence,  and  velocity  profiles  from  the  inversion  of  multi-spectral 
radiance  measurements.  The  following  inversion  methods  are  presented  and  evaluated  for  the 
attainment  of  atmospheric  parameters:  direct  linear,  direct  non-linear,  Tikhonov-Miller, 
Generalized  Singular  Value  Decomposition  (GSVD),  Truncated  Singular  Value  Decomposition 
(TSVD),  and  Damped  Singular  Value  Decomposition  (DSVD).  The  last  five  are  a  special  breed 
of  inversion  techniques  known  as  regularization  methods.  These  methods  can  handle  ill- 
conditioning  well  if  a  suitable  regularization  parameter  is  chosen.  Several  methods  are  evaluated 
for  choosing  this  parameter,  including  the  L-Curve  criterion.  Generalized  Cross  Validation 
(GCV),  Quasi-Optimality  criterion,  and  the  Discrepancy  principle. 

The  results  from  numerical  simulations  using  all  these  techniques  are  analyzed.  The 
regularization  methods  are  the  most  promising  inversion  techniques  for  this  application  because 
the  solutions  are  more  stable  than  the  direct  methods,  and  because  it  reduces  output  error,  given 
an  appropriate  regularization  parameter.  Inversions  for  temperature  using  the  Quasi-Optimality 
criterion  proved  to  be  more  successful  than  the  other  parameter  selection  techniques.  Solutions 
for  atmospheric  turbulence,  however,  had  very  limited  success.  Therefore,  solving  for  velocity 
profiles,  which  depends  upon  turbulence  parameter  results,  is  not  feasible  and  thus  was  not 
studied. 
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1.  Introduction 

Passively  emitted  atmospheric  radiance  in  the  lower  atmosphere  (below  30  km)  depends  on  the 
composition  of  the  gas  phase,  the  concentrations  of  aerosols,  cloud  cover,  precipitation,  and  the 
local  temperature.  The  composition  of  the  atmosphere  varies  dramatically  in  the  vertical 
direction;  the  density  and  temperature  diminish  causing  different  chemical  species  to  be  present  in 
different  proportions.  In  the  lateral  direction ,  the  effects  of  large  weather  systems  and 
topography  cause  spatial  variations  in  the  structure  of  the  atmosphere  over  lateral  scales  on  the 
orders  of  ten  to  hundreds  of  kilometers.  In  addition  to  these  large  scale  variations,  there  are  local 
structural  variations  in  the  atmosphere  caused  by  a  variety  of  dynamic  processes  and  commonly 
grouped  together  under  the  name  of  turbulence. 

Turbulent  motion  in  the  atmosphere  occurs  on  scales  of  millimeters  to  hundreds  of  meters;  this 
motion  results  from  the  breakdown  of  large  scale  coherent  motion  in  the  atmosphere  (such  as 
thermal  plumes  or  gravity  waves)  into  randomly  oriented  vortical  motions  often  referred  to  as 
eddies.  The  effect  of  turbulence  is  to  enhance  the  transfer  of  heat,  mass,  and  momentum  from  the 
surface  into  the  atmosphere  and  to  enhance  mixing  of  aerosols  throughout  the  atmosphere.  As  a 
consequence  of  this  activity,  the  temperature  field  (and  to  a  lesser  extent  the  chemical 
concentration  fields)  take  on  a  spatially  random,  self-similar  structure  with  a  range  of  scales.  As  a 
result  of  the  thermal  structural  variations,  the  atmospheric  radiance  and  the  local  refractive  index 
both  display  variations  characterized  by  the  length  scale  of  the  temperature  fluctuations.  The 
physical  nature  of  atmospheric  radiation  has  been  the  subject  of  much  research,  and  resulting 
models  have  been  incorporated  into  the  LOWTRAN,  MODTRAN,  HITRAN,  and  FASCODE 
atmospheric  radiance  codes.  These  codes  are  designed  to  predict  the  absorptance  and  radiance 
with  moderate  to  very  high  spectral  resolution  and  to  account  for  a  wide  range  of  atmospheric 


15-2 


conditions,  scattered  sunlight,  or  other  ambient  factors.  These  models  treat  the  atmosphere  as  a 
relatively  small  number  of  layers,  each  having  different  properties.  Experimentally,  atmospheric 
radiance  is  measured  using  filtered  radiometers  or  Fourier  Transform  Spectrometry  (FTS). 

The  physics  of  atmospheric  radiation  and  the  techniques  of  radiation  measurement  are  well 
understood,  but  the  use  of  radiation  for  the  determination  of  atmospheric  parameters  is  not.  One 
motivation  for  developing  suitable  techniques  is  the  reduction  in  cost  of  such  devices  as  windshear 
detectors.  A  passive  device  would  be  less  expensive  than  the  lasers  and  radar  currently  used  in 
windshear  detectors.  There  is  an  infrared  device  made  by  Turbulence  Prediction  Systems  in 
Colorado  that  uses  radiation  to  detect  large  temperature  gradients  in  front  of  an  aircraft, 
associating  those  with  large  velocity  changes.  This  device,  however,  does  not  explicitly  solve  for 
turbulence  or  velocities.  The  difficulty  associated  with  obtaining  these  parameters,  as  well  as 
temperature,  is  that  the  radiative  nature  of  the  atmosphere  is  similar  for  different  wavelengths  and 
because  the  radiation  signal  can  be  heavily  attenuated.  This  results  in  ill-conditioned  mathematical 
problems.  The  theory  and  simulations  associated  with  this  report  were  carried  out  in  hopes  of 
overcoming  these  problems  and  accurately  determining  atmospheric  temperature,  turbulence,  and 
velocity  profiles. 
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2.  Theory  and  Modeling 

The  formation  of  radiance  images  can  be  modeled  by  considering  an  imaging  radiometer 
staring  into  the  atmosphere.  The  image  plane  of  the  radiometer  has  an  array  of  detectors,  and  the 
detector  has  an  imaging  volume  defined  by  the  imaging  lens,  aperture,  and  detector.  The 
atmosphere  within  the  imaging  volume  can  be  considered  to  be  a  series  of  parallel  slices 
perpendicular  to  the  optic  axis.  The  composition  (and  therefore  the  radiative  properties)  of  the 
atmosphere  can  vary  along  the  view  axis,  but  it  should  be  essentially  constant  within  an  individual 
slice.  The  temperature  field  may  also  vary  significantly  along  the  view  axis.  In  addition, 
temperature  fluctuations  due  to  turbulence  are  present  within  an  individual  slice. 

Planck’s  Law  is  the  fundamental  equation  for  relating  radiant  energy  and  temperature.  It  gives 
the  radiance  intensity  B  at  wavelength  X  for  a  black  body  at  temperature  T,  where  c,  and  c2  are 
constants. 

B(X,  T )  =  c,  X'5  exp^  -  lj  ( 1 ) 


The  constituent  species  in  each  differential  slice  emit  a  fraction  of  the  black  body  intensity.  If  the 


combined  spectral  absorptivities  of  each  species  is  a,  and  assuming  a  is  equal  to  the  emissivity, 


then  the  intensity  I  from  passively  emitted  radiation  in  an  imaging  volume  striking  a  detector  is 


given  by 


Ix{x,y)  =  \B(X,T(x  j,y  j,z))a(X,z)expl-\a(X,z')dz' 


k iz 


(2) 


where  x  and  y  are  the  coordinates  of  a  particular  detector  cell  in  the  array  of  detectors,  z  is  the 
optical  axis  distance  through  the  imaging  volume,  and  F  is  the  focal  length  of  the  imaging  lens  of 
the  radiometer.  The  exponential  term  is  the  attenuation  of  the  radiance  image  from  the  absorption 
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of  the  atmosphere.  Scattering  effects  are  ignored.  Equation  2  can  be  linearized  with  a  Taylor 
series  about  an  assumed  or  first  guess  temperature  profile.  If  T0(z)  is  the  assumed  profile,  and  if 


K(X,z)  =  —  a(A,z ) exp  -  f a(X,z')dz' 

ft  T- T.M  V  J0 


then  equation  2  becomes 


-  TAx^,y^,!)]dz=  \KiX,z)T(X^,y^,z)dz  (4) 
o  r  r  r=T„(z)J  t  t  o  t  F 


F1'  F 


A  radiometer  can  measure  h  at  multiple  wavelengths  (about  30).  Databases  such  as 
MODTRAN  can  give  a(X,z') at  a  high  spectral  resolution  and  at  distinctive  atmospheric 
conditions.  Therefore,  the  left  hand  side  consists  of  known,  or  measured,  quantities.  The  spatial 
fluctuations  in  temperature  produce  corresponding  spatial  fluctuations  in  the  measured  radiance. 
Assuming  the  temperature  fluctuations  are  small  compared  to  the  mean  temperature,  equation  2 
becomes 

oo 

SIi(x,y)  =  jK(A,z)ST(xj:,A,z)dz  (5) 

o  r  r 

where  the  fluctuation  quantities  are  relative  to  the  mean.  The  structure  function  developed  by 
Kolmogorov  can  be  used  to  relate  the  temperature  fluctuations  to  turbulent  intensity6.  The 
turbulence  of  a  convective  medium  can  be  quantified  by  the  temperature  structure  parameter  Cr . 
If  the  distance  between  any  two  arbitrarily  spaced  points  is  r,  then  the  ensemble  average  of  the 
temperature  fluctuations  can  be  represented  by 

(N')f)  =  Cjr;  (6) 


15-5 


As  the  turbulence  intensity  increases,  then  by  equation  5  the  radiation  fluctuations  will  increase  in 
magnitude.  The  temperature  difference  of  a  point  relative  to  the  mean  temperature  at  any  z  is 
approximately  equal  to  the  average  of  all  the  differences  in  temperature  between  any  two  points  at 
the  same  z.  Thus  equation  5  becomes 

(sil)s]cl(z)M&W4  *  (7> 

0 

where  w  is  the  width  of  a  slice  at  optic  axis  distance  z. 

Equation  4  provides  a  basis  for  obtaining  temperature  profiles  from  the  radiance 
measurements,  whereas  equation  7  provides  a  basis  for  obtaining  temperature  structure  parameter 
profiles,  which  characterize  turbulence,  from  the  radiance  fluctuations. 

The  effect  of  the  ambient  fluid  velocity  field  for  short  time  intervals  can  be  examined  in  the 
context  of  Taylor’s  “frozen  turbulence”  assumption.  This  hypothesis  states  that  the  small  scale 
features  of  an  atmospheric  turbulent  flow  field  change  slowly  compared  to  the  mean  flow3.  So  the 
temperature  field  deforms  slowly  with  respect  to  the  overall  velocity.  Therefore,  over  a  short 
time  interval  t,  the  temperature  fluctuations  evolve  as  ST(x,y,z,t)=ST(x+Ur,y+VT,z,t+r),  where  U 
and  V  are  the  velocities  normal  to  the  optic  axis.  In  addition  to  the  frozen  turbulence  assumption, 
it  is  assumed  that  the  motion  parallel  to  the  optic  axis  results  in  a  negligible  magnification  change. 
Based  on  these  assumptions,  the  short-time  evolution  of  the  radiance  images  depends  only  on  the 
motion  normal  to  the  optic  axis.  The  effect  of  the  velocity  components  can  be  seen  explicitly 
from  the  cross-correlation  of  two  radiance  images,  7i,x  and  I%x,  separated  by  a  small  time  interval. 
The  cross-correlation  function  R  is  given  by 

oo  oo 

rxji  («> v)  =  f  J  SI\.x  (*»  >o 81  ^  (* + u>y + v)  ^ 
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where  u  and  v  are  the  spatial  shifts  in  the  detector  plane.  The  velocity  at  any  location  z  along  the 
optic  axis  has  an  apparent  magnitude  when  viewed  in  the  image  plane  that  is  scaled  by  the 
magnification  Flz.  The  displacement  of  an  object  at  a  location  z  will  move  across  the  detector  by 
an  amount  u'(z)  =  (F/z) U(z) r and  v'(z)  =  (F/z)V(z)r.  Thus  equation  8  becomes 

^xji  (w>v) =  /  J  j  [KMf  ST(xj,yj,z)ST([x  +  u\z))j  ,[y  +  v'(:)]j,z)dxdydz  (9) 

0  —oo — oo 

If  the  spatial  in-plane  cross-correlation  of  the  temperature  field  is  expressed  in  terms  of  the 
temperature  structure  function  and  the  correlation  coefficient  cn,  then  the  above  expression 
becomes 

*,,,  (».v)  =  j[^(A.2)]J  Cl  W(z)jc„([«'(z)]ir,[V(2)]i:,2)  dz  (10) 

o  r  r 

Once  the  temperature  cross-correlations  Cn  are  obtained,  the  in-plane  velocity  components  can 
be  found  from  the  displacement  of  the  correlation  peaks  for  each  location  z. 

The  integrals  in  the  above  equations  are  definite,  and  T(z),  Cr,  and  cjt  appear  only  in  the 
integrand  of  one  side  of  the  equation.  Thus,  equations  4,7,  and  10  are  Fredholm  integral 
equations  of  the  first  kind.  In  a  more  general  form,  this  type  of  equation  is  represented  as 

b 

g(y)  =  $K(y,x)f(x)dx  (11) 

a 

where  g(y)  is  the  given  or  measured  function,  f(x)  is  the  function  being  sought,  and  K(y,x)  is  the 
kernel  or  weighting  function.  In  figure  1,  the  kernels  are  discrete  because  they  contribute  only  at 
one  value  of  x.  Since  only  one  kernel  contributes  to  g(y)  at  a  given  x,  it  is  an  ideal  situation  for 
extracting/fx)  from  a  set  of  measurements. 
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Ideal  Kernel  Contribution 


K1  K2  K3  K4  K5 

- - - - - - X 

xi  X2  X3  X4  X5 

Figure  1:  Ideal,  independent  kernels 

On  the  other  hand,  figure  2  shows  kernel  functions  with  non-discreet  narrow  profiles.  These 
produce  a  slightly  blurred  f(x). 


Kernels  That  Result  in  Slightly  Blurred  f(x) 


XI  X2  X3  X4  X5 

Figure  2:  Close  to  ideal  kernels 


However,  f(x)  can  be  sharpened  with  an  appropriate  inversion  method.  In  real  inversion 
problems,  the  kernels,  such  as  the  ones  in  figure  3,  are  often  non-zero  over  the  entire  range  of  x, 
or  have  smooth  or  common  maximums.  Relative  contributions  of  the  kernels  are  more  difficult  to 
discern,  and  the  kernels  are  more  interdependent.  The  task  of  inversion  becomes  more 
formidable. 
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Figure  3:  Interdependent  kernels 

The  kernels  in  these  may  cause  the  solution  f(x)  to  be  unstable  or  oscillatory  due  to  small  changes 
or  errors  in  the  input.  Hence,  Fredholm  equations  of  the  first  kind  are  often  ill-conditioned,  and 
this  property  greatly  complicates  inversion  problems. 

One  method  for  finding  f(x)  is  by  direct  linear  inversion.  If  there  are  M  discreet  measurements 
and  V  number  of  points  in  the  domain  ofx,  then  a  quadrature  of  equation  1 1  is 

&='LKijfjAx  /  =  1,2,...,  M  (12) 

y=i 

If  Ay  =  K,j  Ax  and  gj  and  f  are  defined  as  vectors  g  and  f  respectively,  then 

Af  =  g  (13) 

f  =  A‘'g  (14) 

Error  due  to  the  quadrature  is  generally  very  small  and  is  not  considered.  The  multiplication  of 
A'1  and  g  will  only  work  when  A  is  square  and  the  number  of  measurements  equals  the  number  of 
rows  in  A.  Furthermore,  A  is  often  singular  or  near  singular  in  ill-conditioned  problems.  Singular 
Value  Decomposition  (SVD),  however,  can  help  to  rectify  these  problems. 
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SVD  can  decompose  A  into  a Mx  N column-orthogonal  matrix  V,  an NxN diagonal  matrix 
W  containing  singular  values,  and  the  transpose  of  an  N  x  N  orthogonal  matrix  V.  After  zeroing 
the  reciprocals  of  the  small  singular  values  to  reduce  round-off  error  in  the  sets  of  equations,  the 


solution  that  will  best  minimize  the  least  squares  residual  is 


(  1  \] 

f  =  V  diag  —  (uTg) 


where  ©j’s  are  the  singular  values1.  SVD  will  only  work  with  M greater  than  or  equal  to  N; 
however,  by  augmenting  A  and  g  with  zeros  so  that  A  becomes  a  square  matrix,  the  process  will 
work  to  give  a  possible  and  useful,  but  never  a  unique  solution. 

Another  method  for  finding/fx)  is  by  direct  non-linear  inversion.  Equation  11  is  no  longer  of 
importance  because  it  is  linear.  A  general  form  of  equation  2  is 

g(y)  =  J  K{y,  x ,  f{x))  dx  (16) 


The  Newton-Raphson  method  for  finding  f(x)  can  be  used  by  letting 


^(/,,/2, •••>/„)=  1 ]KtJ(fj)Ax  -g,  =0  i  =  1,2,..., A/ 


and  if 


then  the  iterative  formula  is 


1 


The  process  is  stopped  when  ||F||  is  small  enough.  To  apply  Newton-Raphson,  M  and  N  must  be 
equal.  J  and  F  can  be  augmented  with  zeros  to  make  M  and  N  equal  if  SVD  is  used. 
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Another  inversion  method  involves  imposing  an  additional  constraint  on  f(x)  to  limit  the  set  of 
possible  solutions.  This  is  called  a  regularization  method.  There  exist  a  handful  of  regularization 
methods,  but  their  general  structure  will  be  explained  below. 

If  there  exists  some  a  priori  knowledge  about  f(x)' s  shape,  then  a  reasonable  constraint  on  f(x) 
might  be  its  measure  of  smoothness.  The  additional  condition  is  arbitrary,  because  the 
measurements  give  no  evidence  that_/(x)  is  smooth. 

Let  r( f)  be  a  positive  scalar  measure  of  the  roughness  of  f.  If  r(l)  is  zero,  then  the  resulting  f  is 
completely  smooth  as  given  by  the  measure  of  r(f).  A  least  squares  criterion  minimizes 
||Af  -  g||,  whereas  a  regularization  method  minimizes  ||Af  -  g||  +  y r(f).  y  is  a  regularization 
parameter  that  can  be  varied  from  zero  to  infinity.  As  y  approaches  infinity,  the  minimization  goes 
to  r( f)  =  0  regardless  of  g.  In  a  purely  least  squares  problem,  y  equals  zero,  and  the  solution 
depends  completely  on  g.  Therefore,  y  is  a  parameter  that  can  be  arbitrarily  set  as  a  trade-off 
between  smoothness  and  agreement  with  the  measurements. 

Most  measures  of  smoothness  are  simple  quadratic  combinations  of  the  f, .  The  measures  of 
smoothness  are  the  sums  of  squares  of  differences  computed  at  every  /'.  Let  B  be  a  matrix  such 
that  Bf  contains  in  its  elements  values  that  are  squared  and  summed  to  give  r(f ).  Suppose  it  is  an 
a  priori  belief  that  f  is  a  second  degree  polynomial,  then  the  third  derivative  of  f ’s  components 
should  equal  zero.  If  Bf  is  to  have  elements  proportional  to  a  third  derivative,  then  B  must  be  the 
third  difference  matrix. 


-1  3  —3  1  0  0  0  •••  0 

0-13-31  0  0  •••  0 

0  •••  0  0  -1  3  -3  1  0 

0  •••  0  0  0  -1  3  -3  1 


(20) 
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In  the  case  of  inversion  for  atmospheric  temperature,  there  is  sufficient  statistical  data  in 
MODTRAN  or  other  sources  to  provide  average  temperature  profiles  which  can  be  used  for 
difference  operators  of  B.  In  this  case,  not  all  the  rows  in  B  would  be  the  same,  but  would 
depend  upon  the  fit  to  the  MODTRAN  data. 

The  quadratic  measure  q  of  Z(-/  +  3/+l-  3f+2  +  f,+2)2  is  given  by  the  square  of  the  norm  of 

Bf 


q(f)  =  (Bf)TBf  =  f  TBTBf  (21) 

Suppose  L  =  BtB,  then  the  minimization  is  of  the  quantity  (Af-g)T(Af-g)  +  yfTLf  This  is  a 
constrained  extremum  problem  with  y  as  an  undetermined  Lagrangian  multiplier.  For  all  /', 

4r  (f  TATAf  -  gTAf  -  f  TATg  +  y  f  TLf )  =  0  (22) 

0i 

which  simplifies  to 

f  =  (ATA  +  yL)-'ATg  (23) 

For  regularization  methods,  equation  23  is  not  usually  solved  by  inverting  the  coefficient 
matrix,  but  by  using  SVD,  or  LU  factorization  if  the  ill-conditioning  has  been  removed  by 
changing  to  a  suitable  basis  through  the  introduction  of  the  side  constraint. 

Within  the  realm  of  regularization  methods,  several  were  compared  for  the  solution  of 
atmospheric  parameters  from  simulated  radiance  measurements.  One  method  called  Tikhonov- 
Miller  regularization  is  simply  a  solution  of  equation  23  with  a  conventional  numerical  technique 
such  as  LU  factorization  or  SVD.  Two  methods  called  Truncated  Singular  Value  Decomposition 
(TSVD)  and  Damped  Singular  Value  Decomposition  (DSVD)  solve  equation  23  in  parts  by 
minimizing  the  roughness  term  subject  to  minimizing  the  residual  term. 
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In  addition,  these  two  methods  derive  a  new  problem  with  a  well-conditioned  rank  deficient 
coefficient  matrix2.  For  TSVD  and  DSVD,  a  truncation  parameter  k  plays  an  similar  role  to  y. 

For  TSVD;  the  rank  deficient  matrix  is  achieved  by  truncation  of  A.  By  truncating  the  SVD 
expansion,  the  smaller  singular  values  (the  ones  beyond  the  cutoff  at  k)  are  set  to  zero  thereby 
reducing  the  sensitivity  of  the  solution  to  the  input.  The  truncated  SVD  expansion  is 

k 

A*=I >/w,vf  (24) 

i=l 

where  k  <  N  and  u,  w,  and  v  are  elements  of  the  matrices  of  the  SVD.  DSVD  is  slightly  more 
sophisticated  than  TSVD  because  it  zeroes  the  smaller  singular  values  of  the  SVD  expansion  by 
smoothing  the  cutoff  point  at  k.  Thus  equation  1 5  would  be  multiplied  by  appropriate  filter 
factors  that  gradually  zero  out  smaller  singular  values. 

A  fundamental  problem  of  regularization  methods  is  their  dependence  upon  choosing  a  suitable 
parameter  for  providing  a  fair  balance  between  both  parts  of  the  minimization.  The  quality  of  the 
solution  depends  heavily  upon  the  choice  of  a  regularization  parameter,  and  there  are  several 
techniques  for  prescribing  a  reasonable  choice.  These  techniques  in  conjunction  with  the 
aforementioned  regularization  techniques  were  considered.  The  techniques  are  called  Generalized 
Cross-Validation  (GCV),  L-Curve  criterion,  the  Quasi-Optimality  criterion,  and  the  Discrepancy 
principle.  These  techniques  inspect  the  spectrum  of  regularized  solutions  as  a  function  of  the 
regularization  parameter.  Each  one  uses  different  functions,  but  all  suggest  using  parameters  that 
correspond  to  unique  points  on  the  function  curve. 

As  stated  by  Hansen2,  Generalized  Cross-Validation  works  on  the  knowledge  that  if  an 
arbitrary  element  of  g  is  removed,  then  the  resulting  solution  should  predict  this  well,  and  a 
selected  regularization  parameter  should  be  independent  of  an  orthogonal  transformation  of  g. 
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GCV(y )  =  l|Af(/).  (25) 

(trace(  I  -  AA1))2 

is  the  function  given  by  Hansen  where  A1  (related  to  the  pseudo-inverse)  is  a  matrix  which 
produces  the  solution  f(y)  when  multiplied  with  g.  The  y  suggested  by  this  approach  is  the  one 
that  minimizes  GCV( y). 

The  L-curve  criterion  computes  the  curvature  of  the  curve  (log  ||Af(y)-g||2,  log  ||Lf(y)||),  where 
y  varies  the  position  along  the  curve.  The  curve  is  generally  L  shaped  because  of  its  appearance 
on  a  log  scale  and  because  the  two  halves  of  the  regularization  are  separately  sensitive  to  changes 
in  y,  producing  a  point,  or  a  value  of  y,  where  there  is  a  fair  balance  between  the  two  different 
types  of  error.  It  is  at  this  y  where  there  is  maximum  curvature,  and  the  L-Curve  criterion 
suggests  using  that  y  for  the  regularization  problem. 

The  Quasi-Optimality  criterion  minimizes  the  function 


Q=r 


kf0')ll 


This  method,  like  the  L-Curve  criterion,  attempts  to  balance  residual  error  and  the  side  constraint 
error,  but  the  procedure  is  different4. 

The  Discrepancy  principle  suggests  choosing  a  regularization  parameter  so  that  the  residual 
norm  is  below  an  error  estimate  when  one  is  known.  In  the  case  of  radiance  measurements, 
radiometer  devices  typically  have  associated  with  them  a  noise  level  that  can  be  used  to  make  this 
error  estimate.  The  regularization  using  the  Discrepancy  principle  is  a  least  squares  problem  with 
the  error  constraint. 


3.  Initial  Simulations 
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There  are  many  techniques  to  choose  from  when  consulting  the  literature  on  inversion  methods. 
The  results  can  be  vastly  different  when  a  greater  number  of  methods  are  considered,  and  many  of 
these  methods  are  based  on  subjective  criteria.  Hence,  numerical  testing  of  multiple  methods  is 
essential  for  finding  an  optimal  one  to  solve  the  particular  problem  at  hand. 

The  inversion  methods  mentioned  above  were  applied  to  the  discreetized  versions  of  the 
radiance  equations  and  evaluated  by  numerical  simulation.  An  initial  set  of  simulations  for  the 
inversion  for  temperature  profiles  were  done  to  compare  the  traditional  numerical  linear  algebra 
techniques  to  that  of  regularization.  Since  Tikhonov-Miller  is  representative  of  most  regularization 
methods,  it  was  chosen  as  a  comparison  to  direct  linear  inversion  and  non-linear,  iterative, 
inversion. 

The  initial  simulations  assume  that  the  measurements  are  taken  with  a  radiometer  having  25 
wavelength  channels,  a  bandwidth  of  0.05%  of  the  wavelength,  a  focal  length  of  400  millimeters, 
and  a  detector  width  of  5  millimeters.  A  exponential  function  was  used  to  simulate  a  plausible  line 
shape  for  each  wavelength.  The  absorption  data  comes  from  a  MODTRAN  database  assuming  a 
US  standard  atmosphere.  Twenty-five  equally  spaced  wavelengths  in  the  range  of  9.65  to  9.90 
microns  are  used  in  the  simulations.  These  wavelengths  were  selected  subjectively  from  contour 
plots  showing  weighting  functions  for  the  entire  spectrum.  Wavelengths  were  selected  such  that 
their  weighting  functions  were  as  dissimilar  as  possible.  The  weighting  functions  for  wavelengths 
in  the  range  of  9.65  to  9.90  microns  appeared  to  vary  sufficiently  to  warrant  their  selection. 
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Arbitrary  Selection  of  Weighting  Function* 


Figure  4:  Weighting  functions  for  9.65  to  9.90  microns 

A  plot  of  the  weighting  functions  for  these  wavelengths  is  shown  in  figure  4.  Because  the  goal  of 
initial  set  of  simulations  was  to  compare  classes  of  inversion  techniques,  optimal  selection  of 
weighting  functions  was  not  a  key  factor.  Simulations  described  later  use  a  more  objective 
approach.  The  arbitrary  parabolic  temperature  profile  shown  in  figure  5  is  used  to  generate  the 
intensity  measurements.  The  assumed  temperature  profile  is  a  parabola  as  well. 


Vertical  Atmospheric  Temperature  Profiles 


Figure  5:  Arbitrary  temperature  profiles 
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6  km  of  vertical  atmosphere  are  considered  in  measurement  generation  and  inversion.  The 
resolution  sought  for  the  temperature  profile  is  20  m.  Since  there  are  more  unknowns  than 
equations,  the  problem  is  certainly  not  injective  (A  has  a  trivial  null  space  so  there  is  no  unique 
solution),  but  this  situation  is  ‘remedied’  by  the  use  of  SVD. 

There  are  three  signal  to  noise  ratios,  SNR,  that  are  compared  in  the  preliminary  simulations: 
100,  20,  and  10.  Comparing  the  effects  of  noise  levels  is  of  particular  interest  because  this  is  an 
ill-conditioned  problem.  The  solution  may  be  unstable  or  oscillatory  due  to  small  changes  or 
errors  in  the  input.  Therefore,  to  evaluate  the  initial  results,  the  following  indicators  of  quality  of 
solution  are  calculated  for  each  inversion  method.  The  absolute  error,  s,  represents  a  measure  of 
the  accuracy  of  the  calculated  profiles. 

The  bias,  B,  is  the  difference  between  the  average  and  the  true  values.  The  temperature  profile 
calculated  with  SNR  =  oo,  T°,  is  used  as  the  “average  value”  in  these  simulations. 


The  standard  deviation,  S,  of  the  calculated  temperatures  around  the  biased  temperature  profile 
T°  represents  a  measure  of  the  precision  of  the  calculated  profiles. 


Equations  27-29  are  applied  to  all  three  inversion  methods  in  the  initial  simulations.  The 
Tikhonov-Miller  method,  however,  depends  on  the  regularization  parameter  y,  so  values  for 
equations  27-29  in  this  case  must  be  computed  as  a  function  of  y. 
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The  direct  linear  inversion  method  produces  the  temperature  profiles  seen  in  figure  6  for  the 
four  signal  to  noise  ratios.  There  are  prominent  oscillations  in  the  output  around  the  actual 
temperature  profile.  As  expected,  the  amplitude  of  these  oscillations  increases  as  the  signal  to 
noise  ratio  decreases. 


Direct  Linear  inversion  of  Temperature 


Figure  6:  Direct  inversion  using  SVD 

The  direct  non-linear  inversion  method  shown  in  figure  7  exhibits  much  smaller  oscillations  than 
the  linear  method.  Note  the  scale  of  the  graph  in  figure  6  is  much  larger  than  the  scale  of  the 
graph  in  figure  7.  The  oscillations  from  the  direct  method  indicate  that  the  eigenvalues  of  the 
kernel  matrix  are  too  small  for  the  solution  to  be  stable.  The  high  degree  of  kernel 
interdependence  results  in  small  eigenvalues.  Although  SVD  can  help  invert  the  kernel  matrix 
by  finding  the  singular  values,  small  changes  in  the  measurements  still  result  in  large  changes  in 
the  solution  because  they  are  multiplied  by  large  eigenvalues  in  the  inverse  kernel  matrix. 
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Direct  Nonlinear  Inversion  for  Temperature 


Figure  7:  Non-linear  inversion  using  Newton-Raphson 


Table  1  gives  the  values  for  equations  27-29  generated  from  the  direct  methods.  The  standard 
deviations  show  the  non-linear  method  is  more  precise,  and  the  increase  in  precision  error  with 
SNR  decrease  is  greater  using  the  linear  method. 


Table  1:  Root  Mean  Squared  Values  from  Direct  Inversion  Methods 


R.M.S.  Value 

Linear 
SNR  =  100 

Non-Linear 
SNR  =  100 

Linear 
SNR  =  20 

Non-Linear 
SNR  =  20 

Linear 
SNR  =  10 

Non-Linear 
SNR  =  10 

Bias  Error  (K) 

4.63 

0.11 

4.63 

0.11 

4.63 

0.11 

Standard  Deviation  (K) 

17.04 

1.33 

18.30 

1.93 

21.77 

2.64 

Absolute  Error  (K) 

18.54 

1.33 

19.70 

1.93 

22.95 

2.64 

The  same  is  true  regarding  each  method’s  accuracy,  given  by  the  absolute  error.  Also,  the  non¬ 
linear  method  shows  equal  standard  deviation  and  absolute  error  because  of  its  low  bias  error. 
Hence,  the  direct  non-linear  method  is  more  accurate  and  more  precise  than  the  direct  linear 
method,  but  each  method  is  susceptible  to  producing  oscillations  from  noise. 

The  root  mean  squared  values  must  be  computed  as  a  function  of  the  regularization  parameter 
for  the  Tikhonov-Miller  method.  The  simulations  computed  values  for  equations  27-29  with  y 
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ranging  from  10"6  to  1010.  In  figure  8,  the  bias  error  for  each  noise  level  has  a  minimum  value 
associated  with  y.  The  minimum  values  are  all  approximately  zero,  and  each  noise  level  exhibits 
two  plateaus  of  bias  error,  one  at  4.75  K,  and  one  around  2  K. 


Bias  for  Regularization  Method 


Regularization  Parameter 

Figure  8:  Bias  errors  for  Tikhonov  regularization 
After  the  minimum  values,  the  bias  errors  increase  rapidly  to  extremely  large  values.  The 
standard  deviations,  shown  in  figure  9,  demonstrate  the  effects  of  regularization  on  the  inversion. 

Standard  Deviation  for  Regularization  Method 


Regularization  Parameter 

Figure  9:  Standard  deviations  for  Tikhonov  regularization 
The  standard  deviations  range  from  initial  values  of  1 3.8  K  to  final  values  of  0  K.  The  standard 
deviations  become  zero  because  y  becomes  large,  so  the  minimization  is  primarily  on  the 
smoothing  functional.  The  smoothing  functional  does  not  depend  on  the  measurements,  so  there 
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is  no  difference  between  the  temperature  profiles  with  and  without  the  noise.  The  standard 
deviations  are  high  initially  because  y  is  small  and  the  emphasis  of  the  minimization  is  instead  on 
the  residual.  The  absolute  error  plots  in  figure  10  are  much  like  the  bias  plots;  they  have 
minimums  at  about  the  same  regularization  parameters  and  increase  rapidly  after  the  minimums. 


Absolute  Error  for  Regularization  Method 


Figure  10:  Absolute  error  for  Tikhonov  regularization 
However,  there  is  a  sharp  dip  in  each  error  before  the  minimums,  unlike  the  plateaus  of  the  bias 
errors.  The  errors  given  in  table  2  are  almost  the  same  as  those  from  the  direct  non-linear 
inversion. 


Table  2:  Values  at  Minimum  Absolute  Error  for  Regularization  Method 


Values  at  Minimum  Absolute  Error 

SNR  =  100 

SNR  =  20 

SNR  =  10 

Bias  Error  (K) 

0.09 

0.12 

0.09 

Standard  Deviation  (K) 

0.30 

1.02 

2.99 

Absolute  Error  (K) 

0.26 

1.03 

2.94 

Regularization  Parameter 

4E6 

7E4 

4E4 

In  a  real  situation,  the  difficulty  of  picking  y  so  that  the  error  happens  to  be  at  the  minimum 
absolute  error  is  a  disadvantage  of  a  regularization  method,  because  the  absolute  error  is  a 
function  of  the  unknown  true  solution.  The  simulations  shown  later  will  demonstrate  the  effect 
of  the  different  ‘objective’  techniques  for  choosing  an  optimal  regularization  parameter. 


15-21 


The  regularization  produces  the  temperature  profiles  in  figure  1 1  when  y  is  such  that  there  is 
minimum  absolute  error.  Compared  to  the  direct  inversion  methods,  there  is  no  instability 
inherent  in  the  profiles.  This  is  the  case  at  any  value  of  y  above  about  0. 1 . 

Regularized  Linear  Inversion  of  Temperature 


0  1  2  3  4  5  6 

Altitude  (km) 


Figure  1 1 :  Tikhonov  regularization  for  temperature  profiles 
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4.  Optimal  Wavelength  Selection 

The  problem  of  choosing  optimal  wavelengths  such  that  the  weighting  functions  are  least 
interdependent  amounts  to  an  optimization  problem  of  the  determinant.  Which  thirty  or  so 
wavelengths  taken  in  aggregate  out  of  about  five  thousand  possible  spectral  lines  produce  a 
coefficient  matrix  with  the  highest  determinant?  To  answer  this  question,  Applied  Statistics 
algorithm  AS295  was  used  as  an  optimization  routine5.  This  algorithm  uses  a  Fedorov  exchange 
routine  to  exchange  candidates  with  existing  selections  until  no  further  increase  in  the 
determinant  can  be  obtained.  The  normalized  weighting  functions  for  one  particular  simulation 
are  shown  in  figure  12.  The  routine  selects  a  spread  of  weighting  functions.  The  centroids  of 
these  weighting  functions  are  dispersed  showing  how  the  row  vectors  span  the  measurement 
space  as  best  as  they  can  from  the  candidate  weighting  functions.  The  rapid  decay  in  the 
functions  shows  the  heavy  attenuation  by  the  atmosphere.  This  set  is  far  from  the  ideal  situation 
as  shown  in  figure  1,  but  any  attempt  to  improve  the  conditioning  of  a  problem  such  as  this  is 
worth  the  effort.  The  optimization  of  wavelengths  procedure  was  used  on  the  next  set  of 
simulations. 
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5.  Final  Simulations 


Since  it  was  found  that  a  regularization  method  is  more  suited  to  this  type  of  problem, 
Tikhonov-Miller,  TSVD,  and  DSVD  regularization  were  compared  in  simulations  using  the  L- 
Curve,  GCV,  and  Quasi-Optimality  parameter  selection  methods.  The  Discrepancy  principle, 
which  has  its  own  regularization  procedure,  was  also  compared.  These  simulations  solve  for  the 
temperature  structure  parameter,  CT ,  as  well.  In  order  to  facilitate  comparison  and  plotting  of 
multiple  regularization  methods,  the  following  simulations  used  the  Matlab  tools  developed  by 
Hansen  by  integrating  them  with  the  existing  C  code  .  Several  cases  were  examined  where  the 
imaging  volume’s  optic  axis  is  parallel  to  the  ground.  In  this  situation,  the  absorption  coefficient 
in  the  radiance  equations  does  not  change  with  z.  The  simulations  used  a  signal  to  noise  ratio  of 
100,  a  bandwidth  of  .05%  of  the  wavelength,  a  focal  length  of  400  millimeters,  and  a  detector 
width  of  5  millimeters.  Except  for  the  SNR,  these  parameters  do  not  significantly  affect  the 
quality  of  the  results.  These  simulations  only  consider  about  one  kilometer  of  atmosphere  in 
order  to  reduce  the  number  of  unknowns.  This  was  done  because  of  the  inaccurate  results 
obtained  for  the  temperature  structure  parameter,  as  shown  later.  If  the  solution  methods  for 
cannot  find  CT  with  25  unknowns,  than  they  certainly  will  not  with  300.  Twenty-five 
wavelengths  are  used  from  the  output  of  the  Fedorov  exchange  algorithm.  No  prevalent 
wavelength  range  was  noted;  the  wavelengths  spanned  the  spectrum  from  2  to  1 8  microns.  An 
equal  number  of  atmospheric  slices  and  wavelengths  were  used.  Arbitrary  profiles  such  as  the 
ones  shown  in  the  plots  were  used  for  generation  of  the  radiance  measurements. 
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The  results  shown  by  the  plots  below  indicate  Quasi-Optimality  criterion  to  be  a  clear  cut 
winner  for  regularization  parameter  selection  for  this  problem.  As  stated  before,  there  is  no 
theoretical  reason  as  to  why  one  or  more  of  these  methods  would  prevail  over  another,  but  from 
simulation  to  simulation,  the  Quasi-Optimality  criterion  consistently  picks  a  parameter  which 
gives  more  accurate  results  than  the  other  methods.  Figures  13-16  show  results  for  the  vertical 
case.  Inversions  for  temperature  shown  in  figures  13,14,  and  16  have  high  amplitude  oscillations 
because  the  parameter  selection  method  did  not  pick  an  optimal  y. 
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The  inversions  for  temperature  using  Quasi-Optimality  produce  solution  profiles  with  root 
mean  squared  errors  no  higher  than  1.5  K  for  assumed  profile  average  deviations  as  high  as  20  K. 
The  structure  parameter  profiles  are  better  for  the  L-Curve  and  Discrepancy  criterion,  but  even 
using  the  Quasi-Optimality  criterion  the  results  are  on  average  300  percent  off  from  the  actual 
solution.  The  attempt  to  resolve  information  from  small,  similar  radiance  fluctuations  is  even 
more  ill-conditioned  than  the  inversions  for  temperature.  Attempts  to  produce  larger  temperature 
fluctuations  in  the  simulations  by  increasing  the  actual  structure  parameter  magnitude  yields 
profiles  with  different  numbers  but  the  same  trends.  As  shown,  the  Tikhonov,  TSVD,  and 
DSVD  methods  each  perform  approximately  the  same,  so  there  is  no  advantage  to  using  one  over 
another  for  this  problem.  The  horizontal  case  simulations  shown  in  figures  17-20  show  similar 
findings  about  the  regularization  methods  and  parameter  selection  techniques. 

The  solution  for  correlation  coefficients  in  equation  10  relies  on  accurate  temperature 
structure  parameter  profiles.  Because  of  the  inaccurate  structure  parameter  results  in  all  cases,  no 
attempt  was  made  to  invert  correlations  of  radiance  images  to  find  velocity  components. 

4.  Conclusions 

A  regularization  method  is  better  suited  to  handle  the  ill-conditioned  nature  of  atmospheric 
radiance  inversion  problems  than  a  direct  inversion  method.  The  direct  linear  method  is 
unacceptable.  The  results  show  high  magnitudes  of  error  and  large  oscillations.  The  direct  non¬ 
linear  method  succeeds  in  reducing  error.  Nonetheless,  the  output  is  still  sensitive  to  small 
changes  or  errors  in  the  measurements.  A  regularization  method,  however,  employs  an  a  priori 
constraint  of  smoothness  on  the  solution  that  reduces  or  eliminates  the  instability  with  a  correctly 
chosen  regularization  parameter. 
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All  regularization  techniques  presented  were  tested  on  the  problem  of  finding  atmospheric 
temperature  profiles  and  temperature  structure  parameter  profiles.  Regularization  parameter 
selection  techniques  were  compared  for  each  of  the  regularization  methods.  The  Quasi- 
Optimality  criterion  consistently  gives  more  accurate  results  than  the  other  three  selection 
procedures:  L-Curve  criterion,  the  Discrepancy  principle,  and  Generalized  Cross  Validation. 
Temperature  solutions  using  Quasi-Optimality  criterion  were  within  1.5  K  whereas  other 
parameter  selection  techniques  gave  unstable  results.  Temperature  structure  parameter  profiles 
also  benefit  the  most  from  the  Quasi-Optimality  criterion,  but  lack  in  accuracy  by  about  300 
percent  in  the  best  case.  Therefore,  inversions  for  velocity  would  be  futile.  From  the  methods 
tested,  it  appears  that  solutions  for  atmospheric  turbulence  and  velocity  from  passive  radiance 
measurements  by  means  of  a  radiometer  device  are  unfeasible. 
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